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Abstract 

Motivated by the quest for a broader understanding of communication complexity of sim¬ 
ple functions, we introduce the class of “permutation-invariant” functions. A partial function 
/ : {0, l} n x {0,1}" —* {0,1,?} is permutation-invariant if for every bijection n : {1,...,n} — > 
{1,..., n} and every x,y e {0, l} n , it is the case that /(x,y) = f(x n ,y n ). Most of the commonly 
studied functions in communication complexity are permutation-invariant. For such functions, 
we present a simple complexity measure (computable in time polynomial in n given an implicit 
description of /) that describes their communication complexity up to polynomial factors and 
up to an additive error that is logarithmic in the input size. This gives a coarse taxonomy of the 
communication complexity of simple functions. Our work highlights the role of the well-known 
lower bounds of functions such as Set-Disjointness and Indexing, while complementing them 
with the relatively lesser-known upper bounds for Gap-Inner-Product (from the sketching liter¬ 
ature) and Sparse-Gap-Inner-Product (from the recent work of Canonne et al. [ITCS 2015]). 
We also present consequences to the study of communication complexity with imperfectly shared 
randomness where we show that for total permutation-invariant functions, imperfectly shared 
randomness results in only a polynomial blow-up in communication complexity after an additive 
O(loglogn) overhead. 
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1 Introduction 


Communication complexity, introduced by Yao [Yao79], has been a central object of study in 
complexity theory. In the two-way model, two players, Alice and Bob, are given private inputs 
x and y respectively, along with some shared randomness, and they exchange bits according 
to a predetermined protocol and produce an output. The protocol computes a function /(•,•) 
if the output equals / (x, y) with high probability over the randomness 1 . The communication 
complexity of / is the minimum over all protocols computing / of the maximum, over inputs 
x and y, of the number of bits exchanged by the protocol. The one-way communication model 
is defined similarly except that all the communication is from Alice to Bob and the output is 
produced by Bob. For an overview of communication complexity, we refer the reader to the book 
[KN97] and the survey [LS09]. 

While communication complexity of functions has been extensively studied, the focus typically 
is on lower bounds. Lower bounds on communication complexity turn into lower bounds on 
Turing machine complexity, circuit depth, data structures, streaming complexity, just to name a 
few. On the other hand, communication complexity is a very natural notion to study on its own 
merits and indeed positive results in communication complexity can probably be very useful in 
their own rights, by suggesting efficient communication mechanisms and paradigms in specific 
settings. For this perspective to be successful, it would be good to have a compact picture of 
the various communication protocols that are available, or even the ability to determine, given 
a function /, the best, or even a good, communication protocol for /. Of course such a goal is 
overly ambitious. For example, the seminal work of Karchmer and Wigderson [KW90] implies 
that finding the best protocol for / is as hard as finding the best (shallowest) circuit for some 
related function /. 

Given this general barrier, one way to make progress is to find a restrictive, but natural, 
subclass of all functions and to characterize the complexity of all functions within this class. 
Such approaches have been very successful in the context of non-deterministic computation by 
restricting to satisfiability problems [Sch78], in optimization and approximation by restricting 
to constraint satisfaction problems [Cre95, KSTWOO], in the study of decision tree complexity 
by restricting to graph properties [Ros73], or in the study of property testing by restricting to 
certain symmetric properties (see the survey [SudlO, GollO] and the references therein). In the 
above cases, the restrictions have led to characterizations (or conjectured characterizations) of 
the complexity of all functions in the restricted class. In this work, we attempt to bring in a 
similar element of unification to communication complexity. 

In this work, we introduce the class of “permutation-invariant” (total or partial) functions. Let 
[n] denote the set {1, • • • , n}. A function / : {0, l} n x {0, l} n —> {0,1, ?} is permutation invariant 
if for every bijection n : [n] —» [n] and every x,y e {0, l} n it is the case that /(x,y) = f(x n ,y n ). 
We propose to study the communication complexity of this class of functions. 

To motivate this class, we note that most of the commonly studied functions in communi¬ 
cation complexity including Equality [Yao79], (Gap) Hamming distance [Woo04, JKS08, CR12, 
Vidll, Shel2, PEG86, Yao03, HSZZ06, BBG14], (Gap) Inner Product, (Small-Set) Disjointness 
[KS92, Raz92, HW07, ST13], Small-Set Intersection [BCK + 14] are all permutation-invariant 
functions. Other functions, such as Indexing [JKS08], can be expressed without changing the in¬ 
put length significantly, as permutation-invariant functions. Permutation-invariant functions also 
include as subclasses, several classes of functions that have been well-studied in communication 
complexity, such as (AND)-symmetric functions [BdWOl, Raz03, Shell] and XOR-symmetric 
functions [ZS09]. It is worth noting that permutation-invariant functions are completely expres- 

: In this work, we also consider partial functions where /(x, y) may sometimes be undetermined, denoted /(x,y) 
In such cases, the protocol can output anything when /(x, y) is undetermined. 
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sive if one allows an exponential blow-up in input size, namely, for every function / (x, y) there 
are functions F, A, B s.t. F is permutation-invariant and /(x,y) = F(A(x),B(y)). So results on 
permutation-invariant functions that don’t depend on the input size apply to all functions. Fi¬ 
nally, we point out that permutation-invariant functions have an important standpoint among 
functions with small communication complexity, as permutation-invariance often allows the use 
of hashing/bucketing based strategies, which would allow us to get rid of the dependence of the 
communication complexity on the input length n. We also note that functions on non-Boolean do¬ 
mains that are studied in the literature on sketching such as distance estimation (given x,y e R n , 
decide if ||x — y|| p < d or if ||x — y|| p > d(l + e)) are also permutation-invariant. In particular, the 
resulting sketching/communication protocols are relevant to (some functions in) our class. 

1.1 Coarse characterization of Communication Complexity 

Permutation-invariant functions on n-bits are naturally succinctly described (by 0(n 3 ) bits). 
Given this natural description, we introduce a simple combinatorial measure m(/) (which is 
easy to compute, in particular in time poly(n) given /) which produces a coarse approximation 
of the communication complexity of /. We note that our objective is different from that of the 
standard objectives in the study of communication complexity lower bounds, where the goal is 
often to come up with a measure that has nice mathematical properties, but may actually be 
more complex to compute than communication complexity itself. In particular, this is true of the 
Information Complexity measure introduced by [CSWY01] and [BYJKS02], and used extensively 
in recent works, and which, until recently, was not even known to be approximately computable 
[BS15] (whereas communication complexity can be computed exactly, albeit in doubly exponen¬ 
tial time). Nevertheless, our work does rely on known bounds on the information complexity of 
some well-studied functions and our combinatorial measure m(/) also coarsely approximates the 
information complexity for all the functions that we study. 

To formally state our first theorem, let R(/) denote the randomized communication complex¬ 
ity of a function / and IC(/) denote its information complexity. Our result about our combinato¬ 
rial measure m(/) (see Definition 3.2) is summarized below. 

Theorem 1.1. Let f : {0,1 } n x {0, l} n —* {0,1,?} be a (total or partial) permutation-invariant 
function. Then, 

m(/) < IC(/) < R(/) < poly(m(/)) + O(logn). 

In other words, the combinatorial measure m(/) approximates communication complexity 
to within a polynomial factor, up to an additive O(logn) factor. Our result is constructive 
— given / it gives a communication protocol whose complexity is bounded from above by 
poly(m(/)) + O(logn). It would be desirable to get rid of the O(logn) factor but this seems 
hard without improving the state of the art vis-a-vis communication complexity and information 
complexity. To see this, first note that our result above also implies that information complexity 
provides a coarse approximator for communication complexity. Furthermore, any improvement 
to the additive O(logn) error in this relationship would imply improved relationship between 
information complexity and communication complexity for general functions (better than what 
is currently known). Specifically, we note: 

Proposition 1.2. Let G(-) be a function such that R (/) < poly(lC(f))+G(logn)for every permutation- 
invariant partial function f on {0,1 } n x {0, l} n . Then, for every (general) partial function g on 
{0, l} n x {0,1}/ we have R(g) < poly(IC(g)) + G(n). 

Thus, even an improvement from an additive O(logn) to additive o(logn) would imply new 
relationships between information complexity and communication complexity. 
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1.2 Communication with imperfectly shared randomness 

Next, we turn to communication complexity when the players only share randomness imperfectly, 
a model introduced by [BGI14, CGMS14]. Specifically, we consider the setting where Alice gets 
a sequence of bits r = (r 1; ... ,r t ) and Bob gets a sequence of bits s = (s 1; ...,s t ) where the pairs 
(r ; ,s ; ) are identically and independently distributed according to distribution DSBS(p), which 
means, the marginals of r* and s* are uniformly distributed in {0,1} and r, and s l are p-correlated 
(he., Pr[r ; =s ; ] = 1/2 +p/2). 

The question of what can interacting players do with such a correlation has been investi¬ 
gated in many different contexts including information theory [GK73, Wit75], probability the¬ 
ory [MO05, BM11, CMN14, MOR + 06], cryptography [BS94, Mau93, RW05] and quantum com¬ 
puting [BBP + 96]. In the context of communication complexity, however, this question has only 
been investigated recently. In particular. Bavarian et al. [BGI14] study the problem in the Simul¬ 
taneous Message Passing (SMP) model and Canonne et al. [CGMS14] study it in the standard 
one-way and two-way communication models. Let ISR p (/) denote the communication complex¬ 
ity of a function / when Alice and Bob have access to p-correlated bits. The work of [CGMS14] 
shows that for any total or partial function / with communication complexity R(/) < k it is the 
case that ISR p (/) < min{0(2 fc ), k + O(logn)}. They also give a partial function / with R(/) < k 
for which ISR p (/) = Q(2 fc ). Thus, imperfect sharing leads to an exponential slowdown for low- 
communication promise problems. 

One of the motivations of this work is to determine if the above result is tight for total func¬ 
tions. Indeed, for most of the common candidate functions with low-communication complexity 
such as Small-Set-Intersection and Small-Hamming-Distance, we show (in Section 4.1) that 
ISR p (/) < poly(R(/)). 2 This motivates us to study the question more systematically and we do 
so by considering permutation-invariant total functions. For this class, we show that the commu¬ 
nication complexity with imperfectly shared randomness is within a polynomial of the commu¬ 
nication complexity with perfectly shared randomness up to an additive O(loglog n) factor; this 
is a tighter connection than what is known for general functions. Interestingly, we achieve this 
by showing that the same combinatorial measure m(/) also coarsely captures the communica¬ 
tion complexity under imperfectly shared randomness. Once again, we note that the O(loglogn) 
factor is tight unless we can improve the upper bound of [CGMS14]. 

Theorem 1.3. Let f : {0, l} n x {0,1 } n —> {0,1} be a permutation-invariant total function. Then, 
we have 

ISRp(/) < poly(R(/)) + O(loglogn) 

Furthermore, ISRp 110 '^/) < poly(R 1 ' waj (/)) + O(loglogn). 

1.3 Overview of Proofs 

Our proof of Theorem 1.1 starts with the simple observation that for any permutation-invariant 
partial function/(•, ■), its value/(x,y) is determined completely by |x|, |y| and A(x,y) (where 
|x| denotes the Hamming weight of x and A(x,y) denotes the (non-normalized) Hamming dis¬ 
tance between x and y). By letting Alice and Bob exchange |x[ and |y[ (using O(logn) bits of 
communication), the problem now reduces to being a function only of the Hamming distance 

2 In fact, this polynomial relationship holds for a broad subclass of permutation-invariant functions that we call 
“strongly permutation-invariant”. A function /(x,y) is strongly permutation invariant if there exists A : {0, l} 2 —» {0,1} 
and symmetric function cr : {0, l} n —* {0,1} such that /(x,y) = a(h(x 1 ,y 1 ), ... ,fr(x„,y n )). Theorem 4.12 shows a 
polynomial relationship between R(/) and ISR(/) for all strongly-permutation-invariant total functions /. 
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A(x,y). To understand the remaining task, we introduce a multiparameter version of the Ham¬ 
ming distance problem GHD” b c (■, •) where GHD” b c (x,y) is undefined if |x| ^ a or |y| b or 
c — g < A(x,y) < c + g. The function is 1 if A(x,y) > c + g and 0 if A(x,y) < c — g. 

This problem turns out to have different facets for different choices of the parameters. For 
instance, if a as b & c, then the communication complexity of this problem is roughly 0((c/g) 2 ) 
and the optimal lower bound follows from the lower bound on Gap Hamming Distance [CR12, 
Vidll, Shel2] whereas the upper bound follows from simple hashing. However, when a <C b, c tv 
b and g & a different bounds and protocols kick in. In this range, the communication complexity 
turns out to be O(log(c/g)) with the upper bound coming from the protocol for Sparse-Gap- 
Inner-Product given in [CGMS14], and a lower bound that we give based on a reduction from 
Set Disjointness. In this work, we start by giving a complete picture of the complexity of GHD for 
all parameter settings. The lower bound for communication complexity, and even information 
complexity, of general permutation-invariant functions / follows immediately - we just look for 
the best choice of parameters of GHD that can be found in /. The upper bound requires more 
work in order to ensure that Alice and Bob can quickly narrow down the Hamming distance 
A(x,y) to a range where the value of / is clear. To do this, we need to verify that / does not 
change values too quickly or too often. The former follows from the fact that hard instances of 
GHD cannot be embedded in /, and the latter involves some careful accounting, leading to a full 
resolution. 

Turning to the study of communication with imperfectly shared randomness, we hit an imme¬ 
diate obstacle when extending the above strategy since Alice and Bob cannot afford to exchange 
|x| and |y| anymore, since this would involve H(logn) bits of communication and we only have 
an additional budget of O(loglogn). Instead, we undertake a partitioning of the “weight-space”, 
i.e., the set of pairs (|x|,|y|), into a finite number of regions. For most of the regions, we re¬ 
duce the communication task to one of the Small-Set-Intersection or Small-Hamming-Distance 
problems. In the former case, the sizes of the sets are polynomially related to the randomized 
communication complexity, whereas in the latter case, the Hamming distance threshold is poly¬ 
nomially related to the communication complexity. A naive conversion to protocols for the imper¬ 
fectly shared setting using the results of [CGMS14] would result in an exponential blow-up in the 
communication complexity. We give new protocols with imperfectly shared randomness for these 
two problems (which maybe viewed as extensions of protocols in [BGI14] and [CGMS14]) that 
manage to reduce the communication blow-up to just a polynomial. This manages to take care 
of most regions, but not all. To see this, note that any total function h(|x|, |y|) can be encoded 
as a permutation-invariant function /(x,y) and such functions cannot be partitioned into few 
classes. Our classification manages to eliminate all cases except such functions, and in this case, 
we apply Newman’s theorem to conclude that the randomness needed in the perfectly shared 
setting is only O(loglogn) bits (since the inputs to h are in the range [n] x [n]). Communicating 
this randomness and then executing the protocol with perfectly shared randomness gives us in 
this case a private-randomness protocol with communication R(/) + O(log log n). 

1.4 Roadmap of this paper 

In Section 2, we give some of the basic definitions and introduce the background material nec¬ 
essary for understanding the contributions of this paper. In Section 3, we introduce our measure 
m(/) and prove Theorem 1.1. In Section 4, we show the connections between communication 
complexity with imperfectly shared randomness and that with perfectly shared randomness and 
prove Theorem 1.3. We end with a summary and some future directions in Section 5. 
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2 Preliminaries 


In this section, we provide all the necessary background needed to understand the contributions 
in this paper. 

2.1 Notations and Definitions 

Throughout this paper, we will use bold letters such as x, y, etc. to denote strings in {0, l} n , where 
the i-th bit of x will be accessed as x t . We denote by |x| the Hamming weight of binary string 
x, i.e., the number of non-zero coordinates of x. We will also denote by A(x,y) the Hamming 
distance between binary strings x and y, i.e., the number of coordinates in which x and y differ. 

def 

We also denote [n] = {1, • • •, n} for every positive integer n. 

Very significant for our body of work is the definition of permutation-invariant functions, 
which we define as follows. 

Definition 2.1 (Permutation-Invariant functions). A (total or partial) function f : {0, l} n x 
{0, 1 } n —» {0,1,?} is permutation-invariant iffor all x,y e {0,1 } n and every bijection n : [n] —» [n], 
f(x n ,y n ) = /(x,y) (where x n is such that xf = x n ^). 

We note the following simple observation about permutation-invariant functions. 

Observation 2.2. Any permutation-invariant function f depends only on |xAy[, |xA-iy|, |-ixAy| 
and |->xA-iy[. Since these numbers add up to n, f really depends on any three of them, or in fact any 
three linearly independent combinations of them. Thus, we have that for some appropriate functions 
g, h, 

/(x,y) = g(|x|, |y|, |x Ay|) = fr(|x|, |y|, A(x,y)) 

We will use these 3 representations off interchangeably throughout this paper. We will often refer to 
the slices of f obtained by fixing |x| = a and |y[ = b for some a and b, in which case we will denote 
the sliced h by either h ab (-) or h(a, b, •), and similarly for g. 

2.2 Communication Complexity 

We define the standard notions of two-way (resp. one-way) randomized commmunication com¬ 
plexity 3 R(f) (resp. R 1_way (/)), that is studied under shared/public randomness model (cf. 
[KN97]). 

Definition 2.3 (Randomized communication complexity R(/)). For any function f : {0, 1 } n x 
{0,1 } n —* {0,1,?}, the randomized communication complexity R(/) is defined as the cost of the 
smallest randomized protocol, which has access to public randomness, that computes f correctly on 
any input with probability at least 2/3. In particular, 

R (/) = min CC(n) 

Vx,ye{0,l} n s . t . /(x,y)^ ?: 

Pr[n(x,y)=/(x,y)]>2/3 

where the minimum is taken over all randomized protocols n, where Alice and Bob have access to 
public randomness. 

The one-way randomized communication complexity R 1_way (/) is defined similarly, with the 
only difference being that we allow only protocols n where only Alice communicates to Bob, but not 
other way round. 

3 we will often abbreviate “communication complexity” by CC 
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Another notion of randomized communication complexity that is studied, is under private 
randomness model. The work of [CGMS14] sought out to study an intermediate model, where 
the two parties have access to i.i.d. samples from a correlated random source p(r,s), that is, 
Alice has access to r and Bob has access to s. In their work, they considered the doubly symmetric 
binary source, parametrized by p, defined as follows, 

Definition 2.4 (Doubly Symmetric Binary Source DSBS(p)). DSBS(p) is a distribution on {0,1} x 
{0,1}, such that for ( r,s ) ~ DSBS(p), 

Pr[r = 1, s = 1] = Pr[r = 0, s = 0] = (1 + p)/4 

Pr[r = 1, s = 0] = Pr[r = 0, s = 1] = (1 — p)/4 

Note that p = 1 corresponds to the standard notion of public randomness, and p = 0 corre¬ 
sponds to the standard notion of private randomness. 

Definition 2.5 (Communication complexity with imperfectly shared randomness [CGMS14]). 
For any function f : {0, l} n x {0, l} n —» {0,1,?}, the ISR-communication complexity ISR p (/) is 
defined as the cost of the smallest randomized protocol, where Alice and Bob have access to samples 
from DSBS(p), that computes f correctly on any input with probability at least 2/3. In particular, 

ISR p (/) = min CC(n) 

Vx,ye{0,l} n s . t . f ( x ,?: 

Pr[n(x,y)=/(x,y)]>2/3 

where the minimum is taken over all randomized protocols n, where Alice and Bob have access to 
samples from DSBS(p). 

def 

For ease of notation, we will often drop the subscript p and denote ISR(/) = ISR p (/). 

We use the term ISR as abbreviation for “Imperfectly Shared Randomness” and ISR-CC for 
“ISR-Communication Complexity”. To emphasize the contrast, we will use PSR and PSR-CC for 
the classical case of (perfectly) shared randomness. It is clear that if p > p', then ISR p (/) < 

ISR,,'(/)• 

An extreme case of ISR is when p = 0. This corresponds to communication complexity with 
private randomness, denoted by R pr j v (/). Note that ISR p (/) < R pr i v (/) for any p > 0. A theorem 
(due to Newman [New91]) shows that any communication protocol using public randomness can 
be simulated using only private randomness with an extra communication of additive O(logn) 
(both in the 1-way and 2-way models). We state the theorem here for the convenience of the 
reader. 

Theorem 2.6 (Newman’s theorem [New91]). For any function f : {0,1}" x {0, l} n —» Fl (any 
range FI), the following hold, 

RprivC/) < R(/) + 0(logn) 

Rj^Cf) < R 1 -^(/) + 0(logn) 
here, R pr i v (/) is also ISR 0 (/) and R(/) is also ISR^/). 

2.3 Information Complexity 

Information complexity 4 is an interactive analogue of Shannon’s information theoiy [Sha48]. 
Informally, information complexity is defined as the minimum number of bits of information that 
the two parties have to reveal to each other, when the inputs (x,y) e {0, l} n x {0, l} n are coming 
from the ‘worst’ possible distribution p. 

4 we will often abbreviate “information complexity” by IC 
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Definition 2.7 ((Prior-Free) Interactive Information Complexity; [Bral2]). For any f : {0,1 } n x 
{0,1 } n —* {0,1}, the (prior-free) interactive information complexity of f, denoted by IC(/), is 
defined as, 

IC(/) = inf sup l(X;n|Y) + /(F;n|X) 

n n 

where, the infimum is over all randomized protocols n such that for all x,y e {0,1 } n such that 
/(x, y) ?, Pr[n(x,y) /(x,y)] <1/3 and the supremum is over all distributions /r(x,y) over 
{0, l} n x {0, 1}". [I(A;B\C ) is the mutual information between A and B conditioned on C] 

We refer the reader to the survey by Weinstein [Weil 5] for a more detailed understanding of 
the definitions and the role of information complexity in communication complexity. 

A general question of interest is: what is the relationship between IC(/) and R(/)? It is 
straightforward to show R(/) > IC(/). Upper bounding R(/) as a function of IC(/) has been 
investigated in several works including the work of Barak et al. [BBCR13], The cleanest relation 
known is that R(/) < 2 0 - Ic ^^ [Bral2]. Our first result, namely Theorem 1.1, shows that for 
permutation-invariant functions, R(/) is not much larger than IC(/). 


2.4 Some Useful Communication Problems 


Central to our proof techniques is a multi-parameter version of Gap-Hamming-Distance, which 
we define as follows. 


Definition 2.8 (Gap-Hamming-Distance, GHD" b c GHD" b c g ). We define GHD^ 1 b c as the fol¬ 
lowing partial function, 


GHD 


n 

a , b , c,g 


(x,y) 


1 if |x| = a, |y| = b and A(x,y) > c + g 
■ 0 if |x| = a, |y| = b and A(x,y) < c - g 
? otherwise 

l 


Additionally we define GHD" b c as the following partial function, 


GHD 


•n 

a , b , c,g 


(x,y) 


1 if\x\ = a,\y\ = bandA(x,y) = c + g 
< 0 if |x| = a, |y| = b and A(x,y) = c - g 
? otherwise 

i 


Informally, computing GHD' 1 , is equivalent to the following problem: 

Q.,D,C,g 

• Alice is given x e {0, l} n such that |x| = a 

• Bob is given y e {0, l} 11 such that |y| = b 

• They wish to distinguish between the cases A(x,y) > c + g and A(x,y) < c — g. 

In the GHD” , problem, they wish to distinguish between the cases A(x,y) = c + g or 
A(x,y) = c — g. Arguably, GHD q b c g is an ‘easier’ problem than GHD" b c . However, in this 
work we will show that in fact GHD" , is not much harder than GHD" , 

a , b , c,g a , b , c,g 


We will use certain known lower bounds on the information complexity and one-way com¬ 
munication complexity of GHD . for some settings of the parameters. The two main settings 

U,D,C,g 

of parameters that we will be using correspond to the problems of Unique-Disjointness and 
Sparse-Indexing (a variant of the more well-known Indexing problem). 
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2.4.1 IC lower bound for Unique-Disjointness 

Definition 2.9 (Unique-Disjointness, UDisj"). UDisj" is given by the following partial function, 


UDisj" (x,y) 


( 

1 if |x| = t, |y| = t and |x A y| = 1 
< 0 if |x| = t, |y| = t and |x A y| = 0 
? otherwise 

l 


Note that UDisj" is an instance of GHD^ 12t _ 11 . 

Informally, UDisj" is the problem where the inputs x,y e {0, l} n satisfy |x| = |y| = t and Alice 
and Bob wish to decide whether |x A y| = 1 or |x A y| = 0 (promised that it is one of them is the 
case). 

Lemma 2.10. For all n e N, IC(UDisj^ n ) = Q(n). 


Proof Bar-Yossef et al. [BJKS04] proved that General-Unique-Disjointness, that is, unique 
disjointness without restrictions on |x|, |y| on inputs of length n, has information complexity 
H(n). We convert the general Unique-Disjointness instance into an instance of UDisj^ 1 by a 
simple padding argument as follows. Given an instance of general Unique-Disjointness (x',}/) e 
{0, l} n x {0, l} n . Alice constructs x = x' o li n ~l x I) o 0 ( - n+ l x D and Bob constructs y = j/ o o^ n+ l y/ ^ o 
iCn—| y l)_ Note that |xAy[ = | x 7 Ay 7 1. Also, we have that x,y e {0, l} 3n , and |x| = |y| = n. Thus, we 
have reduced General-Unique-Disjointness to UDisj^ n , and thus the lower bound of [BJKS04] 
implies that IC(UDisj^ n ) = fl(nf □ 


On top of the above lemma, we apply a simple padding argument in addition to the above 
lower bound, to get a more general lower bound for Unique-Disjointness as follows. 

Proposition 2.11 (Unique Disjointness IC Lower Bound). For all t, w e N, 

IC(UDisj^ t+w ) = 0(min{t,w}) 


Proof. We look at two cases, namely w < t and w > t. 

Case 1. [w < t]: We have from Lemma 2.10 that ICCUDisj^) > Q(w). We map the in¬ 
stance (xfy/) of UDisj^ w to an instance (x,y) of UDisj^ t+w by the following reduction, x = 
x' o iO -M d o o( t-lv ) and y = / o O^ -14 ^ o This implies that IC(UDisj^ t+w ) = fl(w). 


Case 2. [w > t]: We have from Lemma 2.10 that ICfUDisj^) = f2(t). As before, we map 
the instance (x 7 ,}/) of UDisj^ to an instance (x,y) of UDisjj t+lv by the following reduction, 
x = x' o o( w-t ) and y = 5 / o 0^ w_t \ This implies that IC(UDisjj t+w ) = L2(t). 

Combining the above two lower bounds, we get that IC(UDisj^ t+w ) = H(min { t, w}). □ 


2.4.2 1-way CC lower bound for Sparse-Indexing 


Definition 2.12 (Sparse-Indexing, SparseInd"). SparseInd" is given by the following partial func¬ 
tion, 

f 


SparseInd^(x, y) 


1 if |x| = t, |y| = 1 and \x A y| = 1 
< 0 if |x| = t, |y| = 1 and |x A y| = 0 
? otherwise 


V 


Note that SparseInd” is an instance o/GHD^ t 2 
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Informally, SparseInd" is the problem where the inputs x,y e {0,1}" satisfy |x| = t and |y| = 1 
and Alice and Bob wish to decide whether |x Ay| = 1 or |x Ay[ = 0 (promised that one of them is 
the case). 

Lemma 2.13. For all a e N, R 1 ' way (SPARSElND^ n ) = fl(n). 

Proof. Jayram et al. [JKS08] proved that if Alice is given x e {0,1 } n , Bob is given i e [u], 
and Bob needs to determine x t upon receving a single message from Alice, then Alice’s message 
should consist of fl(n) bits, even if they are allowed shared randomness. Using their result, we 
deduce that R 1 ' way (SPARSElND^ n ) = Q(n) via the following simple padding argument: Alice and 
Bob double the length of their strings from n to 2n, with Alice’s new input consisting of (x,x) 
while Bob’s new input consists of (e;,0), where x is the bitwise complement of x and e, is the 
indicator vector for location i. Note that the Hamming weight of Alice’s new string is equal to n 
while its length is 2 n, as desired. □ 

On top of the above lemma, we apply a simple padding argument in addition to the above 
lower bound, to get a more general lower bound for Sparse-Indexing as follows. 

Proposition 2.14 (Sparse-Indexing 1-way CC Lower Bound). For all t,weN, 

Ri- way (Sp ARSE I ND [ +w ) = n(min {t,w}) 

Proof. We look at two cases, namely w < t and w > t. 

Case 1. [w < t]: We have from Lemma 2.13 that R 1 ' way (SPARSElND{j / M ') > fl(vv). We map the 
instance (x/yO of SparseInd^ to an instance (x,y) of SparseInDj +w by the following reduction, 
x = x' o an( j y = y o od~ w \ This implies that R 1_way (SPARSElND^ +w ) > Q(w). 

Case 2. [w > t] : We have from Lemma 2.13 that R 1 " way (SPARSElNDj t ) = H(t). We map the 
instance (x/y 7 ) of SparseInd^ to an instance (x,y) of SparseInd[ +w by the following reduction, 
x = x' o 0 (w “° and y = y' o O^ - ^. This implies that R 1 " way (SPARSElND[ +w ) > Q(t). 

Combining the above two lower bounds, we get that R 1 ' way (SPARSElND^ +w ) > H(min {t, w}). □ 


3 Coarse Characterization of Information Complexity 

In this section, we prove the first of our results, namely Theorem 1.1, which we restate below for 
convenience of the reader. 

Theorem 1.1. Let f : {0,1 } n x {0, l} n —» {0,1,?} be a (total or partial) permutation-invariant 
function. Then, 

nr(/) < IC(/) < R(/) < poly(m(/)) + O(logn). 
where m(/) is the combinatorial measure we define in Definition 3.2. 

3.1 Overview of proof 

We construct a measure m(/) such that m(/) < IC(/) < R(/) < 0(m(/) 4 ) + O(logn). In order 
to do this, we look at the slices of / obtained by restricting |x| and |y|. As in Observation 2.2, let 
h a b (A(x,y)) be the restriction of h to |x| = a and |y| = b. We define the notion of a jump in h a b 
as follows. 
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Definition 3.1 (Jump in h ab ). (c,g) is a jump in h ab ifh ab (c + g) ^ h ab (c-g), both h aJ ,(c + g), 
h a ,b( c — &) are in { 0 , 1 } and h ab (r) is undefined for c — g < r < c + g. 

Thus, any protocol that computes / with low error will in particular be able to solve a Gap- 
Hamming-Distance problem GHD q b c as in Definition 2.8. Thus, if (c, g ) is a jump for h a b , then 

IC(GHD" b c ) is a lower bound on IC(/). We will prove lower bounds on IC(GHD” b c ) for any 
value of a, b, c and g by obtaining a variety of reductions from Unique-Disjointness, and then 
our measure m(/) will be obtained by taking the largest of these lower bounds for IC(GHD a b c ) 
over all choices of a and b and jumps (c, g) in h ab . 

Suppose m(/) = k. We construct a randomized communication protocol with cost 0(k 4 ) + 
O(logn) that computes / correctly with low constant error. The protocol works as follows: First, 
Alice and Bob exchange the values |x| and |y|, which requires O(logn) communication (say, |x| = 
a and |y| = 5). Now, all they need to figure out is the range in which A(x,y) lies (note that finding 
A(x,y) exactly can require f2(n) communication!). Let J{h ab ) = {(c 1; gj), (c 2 , g 2 ), • • • ,(c m ,g m )} 
be the set of all jumps in h ab . Note that the intervals [c ; — g ; ,Cj + g ; ] are all pairwise disjoint. 
To compute h a b (A(x,y)), it suffices for Alice and Bob need to resolve each jump, that is, for 
each i e [m], they need to figure out whether A(x,y) > c + g or A(x,y) < c — g. We will 
show that any particular jump can be resolved with a constant probability of error using 0 (k 3 ) 
communication, and the number of jumps m is at most 2°^ log n. Although the number of jumps 
is large, it suffices for Alice and Bob to do a binary search through the jumps, which will require 
them to resolve only O(kloglogn) jumps each requiring 0(k 3 ) communication. Thus, the total 
communication cost will be 0(k 4 ) + O(logn ). 5 


3.2 Proof of Theorem 1.1 


As outlined earlier, we define the measure m(/) as follows. 


Definition 3.2 (Measure m(/)). Given a permutation-invariant function f : {0,1 } n x {0,1 } n —» 
{0, 1 , ?} and integers a, b, s.t. 0 < a, b < n, let h a b : {0, • • • , n} —> {0, 1 , ?} be the function given 
by h a b (d) = /(x,y) if there exist x,y with |x| = a, |y| = b, A(x,y) = d and ? otherwise. (Note, by 
permutation invariance of f, h ab is well-defined.) Let J?{h ab ) be the set of jumps in h ab , defined 
as /oZZows, 




def 


‘ (c,g) 

i. 


h aA c - sX K,b(c + g) € {0,1} 

K,b( c - g) ^ K,b(c + g) 
Vie(c-g,c + g) : h aJj (i) = ? 


Then, we define m(f ) as follows. 


mCf) 


def 


1 

c 


max max 

a,bG[n ] 

b,g)e/(Vi>) 


min {a, b,c,n — a,n — b,n — c} 


g 


,log 


min {c, n — c} 
g 


where C is a suitably large constant (which does not depend on n). 

We will need the following lemma to show that the measure m(f ) is a lower bound on IC(/). 

5 We will need to resolve each jump correctly with error probability of at most l/f2(kloglogn). And for that we 
will actually require 0(/c 3 log k log log log n) communication. So the total communication is really 0(k 4 ■ log k ■ log log n • 
log log log n) which we write as 0(k 4 ) + O(logn) in short. See Remark 3.7. 
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Lemma 3.3. For all n, a, b, c, g for which GHD^ , is a meaningful problem 6 , the following lower 
bounds hold, 


_ n 1 mini a, b,c,n — a.n — b,n — c} 

IC(GHD abc j > -•—1- 


IC(GHD 


a,b,cj ^ c' l0g 


g 

min|c, n — c} 
g 


Next, we obtain randomized communication protocols to solve GHD" b c . 
Lemma 3.4. Let a < b < n/2. Then, the following upper bounds hold 


R(ghd; A m ) 

R ( GHD :,b,c,p 



+ log 



o 



n — c 


g 


2 


We defer the proofs of the Lemmas 3.3 and 3.4 to Section 3.3. For now, we will use these 
lemmas to prove Theorem 1.1. First, we show that each jump can be resolved using 0(m(/) 3 ) 
communication. 


Lemma 3.5. Suppose m(f) = k, and let |x| = a and |y| = b. Let (c, g) be any jump in h ab . Then 
the problem of GHD n . , that is, deciding whether A(x, y) > c + g or A(x, y) < c — g can be solved, 

Q.,D,C,g 

with a constant probability of error, using 0(k 3 ) communication. 


Proof We can assume without loss of generality that a < b <n/ 2. This is because both Alice and 
Bob can flip their respective inputs to ensure a,b < n/2. Also, if a > b, then we can flip the role 
of Alice and Bob to get a < b < n/2. 

Since a < b < n/2, from the definition of m(/) in Lemma 3.3 we have that, 

(a c n — cl 
min I -, -> < O(fc) 

U g g ) 

We consider two cases as follows. 

Case 1. (c/g) < 0{kj or ((n — c)/g) < Ojk): In this case we have, from part (ii) of Lemma 3.4, 
a randomized protocol with cost O (min j(c/g) 2 , ((n — c)/g) 2 j) = 0 (fc 2 ). 

Case 2. (a/g) < O(fc): In this case, we will show that ((a/g ) 2 (log(b/a) + log (a/g))) < 0(k 3 ). 
Clearly, (a/g ) 2 < 0(fc 2 ) and log (a/g) < O(logfc). We will now show that in fact log(b/a) < 0(k). 
From part (ii) of Lemma 3.3 we know that either log (c/g) < O(k) or log((n — c)/g) < O(k). 
Thus, it suffices to show that ( b/aj < O (min {(c/g), ((n — c)/g)}). We know that b — a+g<c< 
b+a — g. The left inequality gives (b/a) < (c+a — gj/a = (c/g).(g/a)+l —(g/a) < 0(c/g) (since 
g/a < 1). The right inequality gives ((n - bj/a) < ((n - c)/g).(g/a) + 1 - g/a < 0((n - c)/g). 
Since b < n/2, we have (b/a) < ((n — bj/a) < 0(jn — c)/g). □ 


Next, we obtain an upper bound on \jP(h a b) I, that is, the number of jumps in h ab . 

Lemma 3.6. For any function f : {0, l} n x {0, l} n —* {0,1} with mjf) = k, the number of jumps 
in h a b is at most 2 °^ logn. 

6 that is, a,b <n and c + g and c — g are achievable hamming distances when |x| = a and |y| = b. 
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Proof. Let / = {(c 1; gf), ■ ■ ■ , (c m , g m )} be the set of all jumps in h ab . Partition / into U / 2 , 
where = {(c, g) € J? : c < nj 2} and J? 2 — {(c, g) e £ : c > n/2}. From second part in Lemma 3.3 
we know the following: 

V^gje/j : log^-j<0(k) that is, g > c2“ 0(fc) 

V(c,g)e/ 2 : log^—— j<0(/c) that is, g > (n - c)2“ 0(fc) 

Let where the cfs are sorted in increasing order. We have that 

Ci+gi < Cj +1 —g i+1 for alii. That is, we have that ^(1+2“°®) < Cj +1 (l —2 _0 ^- ) ), which gives that 
c i+1 > Cj(l + 2 _0( ^). Thus, n/2 > c p > q(l + 2~°^) p , which gives that \j^ 1 \ = p < 2 0( - fc Hog n. 
Similarly, by looking at n—c/s in j$ 2 , we get that \j? 2 \ ^ 2 0( - fc - ) log n, and thus, \^\ < 2log n. □ 

We now complete the proof of Theorem 1.1. 

Proof of Theorem 1.1. Any protocol to compute / also computes GHD q b c for any a, b and 
any jump (c,g) e J?{h a b ). Consider the choice of a, b and a jump (c,g) e j{h a b ) such that the 
lower bound obtained on IC(GHD^ b c g ) through Lemma 3.3 is maximized, which by definition 
is equal to m(/) (after choosing an appropriate C in the definition of m(/)). Thus, we have 
m(/)<IC(GHD" bjCg )<IC(/). 

We also have a protocol to solve /, which works as follows: First Alice and Bob exchange |x| = a 
and |y| = b, requiring O(logn) communication. From Lemma 3.6, we know that the number of 
jumps in h a b is at most logn, and so Alice and Bob need to do a binary search through 

the jumps, resolving only O(m(/)loglogn) jumps, each with an error probability of at most 
O(l/m(/)loglogn). This can be done using 0(m(/) 3 ) communication (using Lemma 3.5). 
Thus, the total amount of communication is R(/) < 0(m(/) 4 ) + O(logn) 7 8 . All together, we have 
shown that, 

m(/) < ICC/) < R(/) < 6(m(/) 4 ) + O(logn) 

□ 

Remark 3.7. It is a valid concern that we are hiding a loglog n factor in the 0(m(/) 4 ) term. But 
this is fine because of the following reason: If m{f ) < (logn) 1 / 5 , then the O(logn) term is the 
dominating term and thus the overall communication is O(logn). And if m{f) > (logn) 1 / 5 , then 
logm(/) = fi(loglogn), in which case the 0(m(/) 4 ) is truly hiding only polylogm(/) factors. 

In the following section, we prove the main technical lemmas used, namely Lemmas 3.3 and 3.4. 


3.3 Lower and upper bounds on Gap Hamming Distance 

In this section, we prove lower bounds on information complexity (Lemma 3.3), and upper 
bounds on the randomized communication complexity (Lemma 3.4) of Gap-Hamming-Distance. 

Lower bounds 

We will prove Lemma 3.3 by getting certain reductions from Unique-Disjointness (namely Propo¬ 
sition 2.11). In order to do so, we first prove lower bounds on information complexity of two 
problems, namely, Set-Inclusion (Definition 3.8) and Sparse-Indexing (Definition 2.12). We do 
this by obtaining reductions from Unique-Disjointness. 

7 Here, 0(m(/) 3 ) = 0(m(/) 3 log m(/) log log logn). 

8 Here 0(m(/) 4 ) = 0(m(/) 4 log m(/) log logn log log logn) 
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Definition 3.8 (SetInc^ ). Let p < q < n. Alice is given x e {0, l} n such that |x| = p and Bob is 
given y e {0, l} n such that |y| = q and they wish to distinguish between the cases |x A y[ = p and 
l x A y| = p — 1. Note that SetInc” is the same as GHD^ +1 r 

Proposition 3.9 (Set-Inclusion lower bound). V t,w e N, KXSetInc 2 '^) > fl(min(t,w)) 

Proof. We know that IC(UDisj 2t+w ) > 0(min(t,w)) from Proposition 2.11. Note that UDisj 2t+w 
is same as the problem of r If we instead think of Bob’s input as complemented, we 

get that solving GHD^^^ , is equivalent to solving GHD^ t ^ w w+1 ,, which is same as SETlNC 2t t +™. 

Thus, we conclude that IC(SETlNc" t+w ) > fl(min(t,w)). □ 

Proposition 3.10 (Sparse-Indexing lower bound). VteN, ICfSPARSElND 2 ^ 1 ) > fi(f) 

Proof. We know that ICfUDisj^ 1 ) > fi(t) from Proposition 2.11. Recall that SparseInd 2 ^ 1 is an 

instance of GHD 2t 12 t 1 - Alice uses x to obtain the Hadamard code of x, which is X e {0, l } 2 
such that X(a) = a • x (for a e {0, l} t+1 ). On the other hand, Bob uses y to obtain the indicator 

otd-l 

vector of y, which is Y e {0,1} such that Y(a) = 1 iff a = y. Clearly |Y| = 1. Observe that, 

|X| = 2 t and X(y) = y • x is 1 if |x A y| = 1 and 0 if |x A y| = 0. Thus, we end up with an instance 
of SparseInd 2 [ + . Hence ^(SparseInd 2 ^ 1 ) > ICfUDisj^ 1 ) > Q(t). □ 

We now state and prove a technical lemma that will help us prove Lemma 3.3. 

Lemma 3.11. Let a < b < n/2. Then, the following lower bounds hold, 

(i) iC(GHD" Acg )> 0 (min{^,^}) 

(it) IC(GHD" b c g ) > O (min 
(Hi) IC(GHD" fe c g ) > O (min {log (^) ,log (^) }) 

Proof. We prove the three parts of the above lemma using reductions from UDisj, SetInc and 
SparseInd respectively. Note that once we fix |x| = a and |y| = b, a jump (c,g) is meaning¬ 
ful only if b — a + g < c < b + a — g (since b — a < A(x,y) < b + a). We will assume that 
c = b + a — g(mod 2), so that c + g and c — g will be achievable Hamming distances. 

Proof of (i). We obtain a reduction from UDisj 2t for which we know from Proposition 2.11 
that IC(UDisj 2t ) > H(t). Recall that UDisj 2t is same as GHD f t2t ~i v Given any instance of 

GHD^ t 2t l 1 , we first repeat the instance g times to get an instance of GHD g ^ gt g( 2 t -i) g ■ Now, 
we need to append (a — gt) l’s to x and (b — gt) l’s to y. This will increase the Hamming distance 
by a fixed amount which is at least (b — a) and at most (b + a — 2gt). Also, the number of inputs 
we need to add is at least ((a — gt) + (b — gt) + (c — g(2t — l)))/2 9 . Thus, we can get a reduction 
to GHD” . if and only if, 

Cl, 0,C,§ 

(b — a) < c — g(2t — 1 ) < b + a — 2gt 

(a-gt) + (b - gt) + (c - g(2t - 1)) 
n > 3gt + - - - 

9 We will be repeatedly using this idea in several proofs. The reason we obtain the said constraints is as follows: 
Suppose Alice has to add A l’s to her input and Bob has to add B l’s to his input. Then the hamming distance increases 
by an amount C such that \A — B\ < C < A + B. Also, the minimum number of coordinates that need to be added to 
achieve this is at least (A + B + C)/2 


a+b—c c+b—a j j 

g ’ g ) J 
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The constraints on c give us that 2 gt < c — (b — a) + g and c < b + a — g (recall that this is always 
true). The constraint on n gives that gt < n — (a + b + c + g)/2, which is equivalent to 


t < 


n — a 


n-c-g 


2 g 2 g 

Thus, we can have the reduction work by choosing t to be 
' c-b + a + g n-c - g' 


t = mm 


2g 


2 g 


= H min 


c — b + a n — c 


g 


g 


(since n> a + b) and thus we obtain 


IC(GHD a ‘ fc c ) > IC(UDisj^) > O I min 


D ,C,g 


c — b + a n 


g 


Proof of (ii). We obtain a reduction from SetInc™ w (where m = 2t + w) for which we know 
from Proposition 3.9 that IC(SetInc™ w ) > fl(min {t,w}). Recall that SetIncJ 7 ^ is same as 

1 v we first repeat the instance g times to get 


GHD 


t,t+w,w +1 


r Given an instance of GHD f t+ww+l 

■gm 


an instance of GHD*' t " t+ gw+g g - Now, we need to append (a — gt) l’s to x and (b — gt — gw) 
l’s to y. This will increase the Hamming distance by a fixed amount which is at least |b — a — gw| 
and at most (b — gt — gw) + (a — gt). Also, the number of inputs we need to add is at least 
((a — gt) + (5 — gt — gw) + (c — g(w + l)))/2. Thus, we can get a reduction to GHD q b c if and 
only if, 

| b — a — gw| < c — g(w + 1) < b + a — 2gt — gw 

(b-gt-gw) + (a-gt) + (c-gw-g) 
n > 2gt + gw + --- 

The left constraint on c requires c > max {b — a + g,2gw — (b — a) + g}. We know that c > 
b — a + g, so the only real constraint is c > 2gw — (b — a) + g, which gives us that. 


w < 


c + b - a - g 
2 g 


The right constraint on c requires c< b + a — 2gt + g, which gives us that, 

a + b-c + g 


t < 


2 g 


Suppose we choose t = a+b 0+8 . Then the constraint on n is giving us that, 

a+b+c-g a+b-c+g a+b+c-g 
n> gt -1---=---1-;- = a + b 


We already assumed that a < b < n/ 2, and hence this is always true. 

Thus, we choose t = a+b 2g c+g and w = C+t> 2g S , and invoking Proposition 3.9, we get, 


IC(GHD 


a,b,c,g 


) > IC(SETlNcf+ w ) > min({t,w}) > O ( min 


a + b — c c + b — a 


g 


ryt+1 

Proof of (iii). We obtain a reduction from SparseInd^ for which we know from Proposi 


tion 3.10 that IC(SPARSElND 2 t ) > G(t). Recall that SparseInd^ is same as GHD“ t 1 2t x , which 

_2 t+1 ot+i 

is equivalent to GHDj 2t 2 t 1 (if we flip roles of Alice and Bob). Given an instance of GHD 1 2t 2 t v 
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_g2 t+1 

we first repeat the instance g times to get an instance of GHD g t t Now, we need to append 
(a — g) l’s to x and (fa — g2 f ) l’s to y. This will increase the Hamming distance by a fixed amount 
which is at least |fa — g2 f — a + g| and at most (fa — g2 f + a — g). Also, the number of inputs 
we need to add is at least ((a — g) + (fa — g2 £ ) + (c — g2 £ ))/2. Thus, we can get a reduction to 

GHD a,b,c,g if and on ly if> 


\b ~ g2 t - a + g| < c - g2 £ < (fa - g2 £ + a - g) 
n > g2 t+1 + ■ 


t+ i , (a-g) + (b-g2 t ) + (c-g2 t ) 


The left constraint on c requires c > max {fa — a + g, 2g2 £ — fa + a — g}. Since c > b — a + g 
anyway, this only requires 2g ■ 2 C < c + fa — a + g. The right constraint on c requires c < fa + a — g 
which is also true anyway. The constraint on n is equivalent to, 

, a+b+c—g n-a—b n—c—g 

g2 l <n ---- =---+-—^ 


Thus, we choose t such that, 
t = min {log 2 


2 g 


and invoking Proposition 3.10, we get, 


c+b-a+g\ (n—c—g 
>log 2 


2 g 


c n — c 
> £1 ( min —,- 

g g 


IC(GHD a&cg ) > ^(SparseInd^ ) > Q(t) > | min-j log 2 | — J , log 2 


n — c 


g 


□ 


We are now finally able to prove Lemma 3.3. 

Proof of Lemma 3.3. Assume for now that a < fa < n/2. From parts (i) and (ii) of Lemma 3.11, 
we know the following, 

c — fa + a n — c 



> 

LI l^min 


> 

D. min 


g g 
a + b — c c + fa — a 


Adding these up, we get that, 


IC(GHD a b c g ) > SI ( min 


c-fa + a n — c 


g 


g 


g 


+ min 


g 


a + fa — c c + fa — a 


g 


g 


Since min {A,B} + min {C,D} = min{A + C,A + D,B + C,B + D}, we get that, 

(2a 2c n + a + b — 2c n + fa — a 


IC(GHD a b c > T2 ( min | , 


g 


g 


For the last two terms, note that, n + a + b — 2c > n — c (since a + fa > c) and n + fa — a > n (since 
fa > a). Thus, overall we get. 


IC(GHD a>b)C g ) > n ( min { -, -, 


a c n — c 


g g g 
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Note that this was assuming a < b <nj2. In general, we get, 


IC ( GHD a,b,c,P > « 


min {a, b,c,n — a,n — b,n — c} 


8 


[We get b, (n — b), (n — a) terms because we could have flipped inputs of either or both of Alice 
and Bob. Moreover, to get a< b, we might have flipped the role of Alice and Bob.] 


The second lower bound of IC(GHD" b c g ) > O (^log c l J j follows immediately from part 

(iii) of Lemma 3.11. 

We choose C to be a large enough constant, so that the desired lower bounds hold. □ 


Upper bounds 

We will now prove Lemma 3.4. 

Proof of Lemma 3.4. We use different protocols to prove the two parts of the lemma. 

Proof of Part 1. The main idea is similar to that of Proposition 5.7 of [CGMS14], except that we 
first hash into a small number of buckets. The details are as follows. 

Alice and Bob have an instance (x,y) of GHD n . , that is, Alice has x e {0, l} n such that 

|x| = a, and Bob has y e {0, l} n such that |y| = b, and they wish to distinguish between the cases 
A(x,y) > c + g and A(x,y) < c - g. 

Bob defines ye {1, — l} n such that y t = 1 — 2jq for every i e [n]. Then, the number of — 1 
coordinates in y is exactly equal to b. It is easy to see that (x,y) = (A(x,y) — b), and hence 
computing GHD n . (x,y) is equivalent to distinguishing between the cases (x,y) > aa and 

Q., D,C,g 

clef def 

(x,y) < /3a, where a = (c — b + g)/a and /3 = (c — b — g)/a. Note that a,/3 e [-1,4-1]. 

Alice and Bob use their shared randomness to generate a uniformly random hash function 
h : [n] —» [B] where B = f 100(a + b)(a/g) 2 . Basically, each coordinate i e [n] is mapped to one 
of the B ‘buckets’ uniformly and independently at random. Let supp(x) :={iG [n] :q = l}. We 
say that a coordinate i e supp(x) is bad if there is a coordinate j e [n], j 7 ^ i such that fr(j) = h(i) 
and at least one of Xj = 1 or y ; - = 1. For any i e supp(x), the probability that i is bad is at most 
(l — (1 — l/B)( a+i> ^ < (a + b)/B = g 2 /100a 2 . Thus, the expected number of bad coordinates 
is at most (g 2 /100a), and hence by Markov’s inequality, we have that with probability at least 
1 — g/(10a), there are at least a(l — g/(10a)) coordinates in supp(x) that are not bad. Suppose 
we have chosen an h such that at least a(l — g/( 10 a)) coordinates in supp(x) that are not bad. 

def 

Let t = (2B/a)ln(20a/g) and consider the following atomic protocol T h (x,y): 

• Alice and Bob use shared randomness to sample a sequence of l indices b 1 ,... ,b ( [B], 

• Alice picks the smallest index j e [£] such that h _ 1 (b J ) n supp(x) is non-empty and sends j 
to Bob. If there is no such j, then the protocol aborts and outputs 4-1. 

• Bob outputs n 

ieh~Hb } ) 

We first show that the difference in the probability of T h (x,y) outputting 4-1, in the two cases 
(x,y) > aa and (x,y) < [3a, is at least f 1(g/a). In particular, we will show the following, 

Pr[T h (x,y) = 4-1 | (x,y) > aa] > 1/2 + a/2 - 3g/20a (1) 

Pr[T/i(x,y) = 4-1 | (x,y) < Pa] < 1/2 + [3/2 + 3g/20a (2) 

Before we prove Inequalities 1 and 2, we first show how to use these to obtain our desired 

protocol. We observe that the difference in the two probabilities is at least (a — /3)/2 — 3g/10a > 
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Q(g/a). We repeat the above atomic procedure T times and declare the input to satisfy (x,y) > 
aa, if the number of times the output is +1, is at least ((l+a+/3)/2)T, and (x,y) < /3a otherwise. 
A Chernoff bound implies that we will have the correct value the given GHD" h c instance with 

probability at least 1 — e~ n( ~ 8 T ^ a \ Setting T = 0((a/g) 2 ) gives us that our protocol gives the 
right answer with probability at least 3/4. And the overall communication complexity of this 
protocol is 

0((a/g ) 2 log 2 W) = 0((a/g ) 2 log 2 (2 B/a log 2 (10a/g))) = 0((a/g) 2 (log(h/a) + log(a/g))) 


We now prove Inequalities 1 and 2. Note that there are three possible situations that can arise in 
T h (x,y), 

1 . the protocol aborts (that is, for all j e [£], n supp(x) = 0) 

2. the index j picked by Alice is such that 1 1 (b ; ) n supp(x)[ > 1 

3. the index j picked by Alice is such that 1 1 (b ; ) n supp(x)| = 1 

We will say that an index b e [B ] is ‘good’ if | ft -1 (b) nsupp(x)| = 1, ‘bad’ if | fr “ 1 ( b ) nsupp(x)| > 1, 
and ‘empty’ if |fr _ 1 (b) n supp(x)| = 0 

For Inequality 1, we have that (x,y) > aa, and we wish to lower bound the probability that 
the protocol T h (x,y) outputs +1. Notice that, when the protocol aborts, it always outputs +1. 
And conditioned on not aborting, the probability that we are in situation (3) and not (2), is at 
least (1 — g/10a). This is because the number of non-‘empty’ b’s is at most a, but the number of 
‘good’ b’s is at least a(l — g/lOa). Thus, overall, we get that. 


Pr[T h (x,y) = +1 | (x,y) > aa] > 

> 


> 



For Inequality 2, we have that (x,y) < /3a, and we wish to upper bound the probability 
that the protocol r h (x,y) outputs +1. The probability that we are in situation (1) is at most 

^1 — < e~ a ^ 2B = g/20a. Conditioned on not aborting, the probability 

that we are in situation (3) and not (2), is at least (1 — g/10a) as before. Thus overall, we get 
that. 


Pr[T h (x,y) = +1 | (x,y) < /3a] < 

< 




Proof of Part 2. Kushilevitz, Ostrovsky and Rabani [KOROO] gave a protocol for distinguishing 
between the cases A(x,y) > c + g and A(x,y) < c — g using 0((c/g) 2 ) communication (without 
requiring any knowledge about |x| and |y[). The upper bound of 0(((n — c)/g) 2 ) follows by Alice 
flipping her input so that the task is of distinguishing between the cases A(x,y) > n — c + g and 
A(x,y) < n — c — g, and then the upper bound of [KOROO] applies again. □ 


4 Communication with Imperfectly Shared Randomness 

In this section, we prove our second result, namely Theorem 1.3, which we restate below for 
convenience of the reader. 
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Theorem 1.3. Let f : {0, l} n x {0,1}" —» {0,1} be a permutation-invariant total function. Then, 
we have 


ISRp(/) < poly(R(/)) + O(loglogn) 
Furthermore, ISRp" av (/) < poly(R 1 ' way (/)) + O(loglogn). 


The outline of this section is as follows: In Section 4.1, we prove upper bounds on the ISR-CC 
of two basic problems: Small-Set-Intersection (in Section 4.1.1) and Small-Hamming-Distance 
(in Section 4.1.2). As an aside, in Section 4.1.3, we introduce a new class of functions, called 
strongly permutation-invariant functions, which is a generalization of both Set-Intersection and 
Hamming-Distance, and show that for every strongly permutation-invariant function / there is 
a polynomial relationship between R(/) and ISR(/) (Theorem 4.12). In Section 4.2 we give an 
overview of the proof for Theorem 1.3. The proof of the 2-way part of Theorem 1.3 appears 
in Section 4.3, and that of the 1-way part appears in Section 4.4. In Section 4.5, we prove a 
technical lemma needed in the proof of the 1-way part of Theorem 1.3. 

4.1 ISR Protocols for Basic Problems 

In this section, we prove that ISR-CC and PSR-CC are polynomially related for some specific 
functions (note that this is stronger than Theorem 1.3, in the sense that the additive O(loglogn) 
factor is not required). In particular, we give ISR protocols for two basic problems: Small-Set- 
Intersection (in Section 4.1.1) and Small-Hamming-Distance (in Section 4.1.2), such that the 
communication costs of these protocols are polynomially related to the respective PSR-CC of these 
functions. Our motivation in doing so is two-fold: firstly, to give techniques for designing efficient 
ISR protocols, and secondly because these protocols are at the heart of our proof of Theorem 1.3. 
In addition to these, we also give ISR-protocols for the class of strongly permutation-invariant 
functions which we describe in Section 4.1.3. 

4.1.1 Small Set Intersection 

The Small-Set-Intersection problem is defined as follows. 

Definition 4.1 (Small-Set-Intersection). SSI" b : {0, l} n x {0,1}" —» ZU{?} is defined as follows, 



Essentially, Alice is given x e {0,1}" such that |x| = a, Bob is given y e {0, l} n such that |y| = b, 
and they wish to compute |x A y|. 

The next two lemmas show that ISR-CC and PSR-CC are polynomially related for HD} (for 1-way 
and 2-way models respectively). 

Lemma 4.2 (1-way ISR Protocol for SSI} b ). Let n,a,b e N, such that, a,b < n/2. Then, 


n(max{a,log5}) < R^SSI},) < ISR^(SSI} b ) = 0(alog(ab)) 


Proof. We first describe the 1-way ISR-protocol for SSI} fe . 

Let x be Alice’s string and y be Bob’s string, with a = |x| and b = |y|. First, Alice and Bob 
use their correlated randomness to sample hash functions h A , h B : [n] —* {0, l} r such that for any 
i, h A (i) and h B (i) are p-correlated strings, but for i j, h A (i) and h B (j) are independent. Now, 
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Alice sends {h A (i) : x ; = 1}, which Bob sees as h 1 , h 2 ,...,h a . Then, Bob computes the size of the 
set j j e supp(y) : A(h B (j), h ; ) < | ^ for some i e [a] j and outputs it. 

By the Chernoff bound, we have that for any i, A(h A (i),h B (ij) < | ^ with probability 

1 — 2 _f 2 ( -G. Also, for any i ^ j, A(h A (i),h B (j)) > | — ^ with probability 1 — 2~^ r \ Thus, the 
probability that for every i such that x ; = y ; = 1 , A (h A (i),h B (i)) < | ^ and for every i 7 ^ j 
such that x ; = y ; - = 1, A (h A (i),h B (i)) > | ^ is at least 1 — ab2~ n ^ (by a union bound). Thus, 
with probability at least 1 — ab2~ n ^\ Bob is able to correctly determine the exact value of |xAy|. 
Choosing r = 0(log(ab)) yields a 1-way ISR protocol with 0(alog(ab)) bits of communication 
from Alice to Bob. 

The lower bound R 1 _way (SSI" b ) > H(max{a, log b }) will follow from Lemma 4.18 which we 

proved in Section 4.5, because any protocol for SSI" , can be used to compute GHD^' bcl , where 
we can choose c to be anything, in particular, we choose c ^ max {a, b}. □ 

Lemma 4.3 (2-way ISR Protocol for SSI" b ). Let n,a,b e N. Let a,b < n/2. Additionally, assume 
wlog that a < b (since the roles of Alice and Bob can be flipped). Then, 

fl(max{a,log 5}) < R(SSI" b ) < ISR p (SSI" b ) = 0(alog(ab)) 

Proof The ISR protocol is same as in proof of Lemma 4.3, with the difference that we flip the 
roles of Alice and Bob if a > b. The lower bound of R l way (SSI" b ) > fi(max{a, logb}) follows 

from Lemma 3.3 as any protocol for SSI" b also solves GHD^ b b r □ 

Remark 4.4. The protocol given in proof of Lemma 4.2 can be viewed as a generalization of the 
protocol of [BGI14] for the Equality function. More precisely, the Equality function on n bit strings 
is equivalent to the SSI^ 1 problem. This is because Alice and Bob can keep a list of all 2" elements of 
{ 0 , 1 }" (e.g., in lexicographic order) and then view their input strings as subsets of cardinality 1 of 
this list. 

We will repeatedly use the following corollary which follows from Lemma 4.2 by setting a = 1 
and b = 2 k . Note that SSI" is like the reverse direction of Sparse-Indexing, in which Alice had 
a large set and Bob had a singleton set. 

Corollary 4.5 (Reverse Sparse-Indexing). V n, k e N, ISR 1 ' way (SSI" k ) = 0(k ). 


4.1.2 Small Hamming Distance 

The Small-Hamming-Distance problem is defined as follows. 

Definition 4.6 (Small-Hamming-Distance). Let n,k& N and 0 < k < n — 1. Then HD£ : {0,1}" x 
{ 0 , 1 }" —» { 0 , 1 } is defined as follows, 


HD"(x,y) 


f 1 (f A(x,y) < k 
\0 if A(x, y) > k 


Essentially, Alice is given x e {0,1}" and Bob is given y e {0,1}" and they wish to distinguish 
between the cases A(x,y) < k and A(x,y) > k. 


The following lemma shows that ISR-CC and PSR-CC are polynomially related for HD}' (for both 
the 1-way and 2-way models). 
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Lemma 4.7 (ISR Protocol for Small-Hamming-Distance). Let n,k& N. Additionally, assume wlog 
that k < n/2 (since Bob can flip his input, and thus computing HD/ is equivalent to computing 
HD/_ fc /. Then, 

n(fc) < R(HD/) < ISR 1 ' way (HD^) < 0(k 2 ) 

In order to prove Lemma 4.7, we will use the following protocol (from [CGMS14]) twice, namely 
in the proofs of Lemmas 4.9 and 4.10. 

Lemma 4.8 (ISR protocol for Gap-Inner-Product [CGMS14]). Let -1 < s < c < 1 be real num¬ 
bers. Assume that Alice is given a vector uel" such that ||u|| 2 = 1, and that Bob is given a vector 
vel" such that ||v|| 2 = 1. Then, there is a 1-way ISR protocol that distinguishes between the cases 
<u,v) > c and (u,v) < s using 0(l/(c — s) 2 ) bits of communication from Alice to Bob. 

Lemma 4.9. Assume that k < n/ 20. Then, there is a 1-way ISR protocol that distinguishes between 
the cases A (x,y) < k and A (x,y) > n/10 using 0(1) bits of communication. 

Proof. Let Alice construct the vector u e I" by setting u, = (— l ) x '/yfn for every i e [n] and let 
Bob construct the vector v e R n by setting v, = (— l) y '/yTI for every i e [n]. Then, we have 
that ||u|| 2 = ||v|| 2 = 1. Furthermore, (u,v) = 1 — 2A(x,y)/n. Therefore, A(x,y) < k implies 
that (u,v) > 1 — 2 k/n and A(x,y) > n/10 implies that (u,v) < 4/5. Setting c := 1 — 2k/n and 
s := 4/5 and using the assumption that k < n/ 20, Lemma 4.8 yields a 1-way ISR protocol with 
0(1) bits of communication from Alice to Bob (since (c — s) > 1/10). □ 


Lemma 4.10. Assume that k < n/ 20. Then, there is a 1-way ISR protocol that distinguishes between 
the cases A(x,y) < k and k < A (x,y) < n/10 using 0(k 2 ) bits of communication. 


Proof. As in the proof of Lemma 4.9, we let Alice and Bob construct unit vectors u,v e R' 1 by 
setting u, = (—l) Xi /v^t and v ; = (—1 ) yi /yfn (respectively) for every i e [n]. Then, (u,v) = 1 — 
2A(x,y)/n. Let t := n/(10k ). Alice tensorizes her vector t times to obtain the vector u® c e M n , 

t 

namely, for every i 1 ,i 2 , ...,i t e [n], she sets u® c . . ^ = O u !j- Similarly, Bob tensorizes his 

vector t times to obtain the vector v 0t e R n \ Observe that Hu®^^ = | |u| \ [ 2 = 1, | |v 0t || 2 = | |v| = 
1 and 


(u® t ,v® t ) = (u,v) t = (1 - 2A(x,y)/n) t . 

Therefore, A(x,y) < k implies that (u^v®^ > (1 — 2 k/nf :=c, and k < A(x,y) < n/10 implies 
that (u®*^^®^ < (1 — 2(k + 1 )/n) f := s. The inner product gap c — s is at least 



> 1(1 — e - 2 /( 90 ) 


[•.• t = n/10k and k < n/ 20 ] 


> 


n 


l 

k 


where the first inequality above follows from the fact that (1 + x) r < e rx for every x,rel with 
r > 0, as well as the fact that (1 — x) r > 1 — xr for every 0 < x < 1 and r > 1. Moreover, the 
last equality above follows from the fact that (1 — e~ x )/x —» 1 as x —» 0. Therefore, applying 
Lemma 4.8 with c — s = Tt(l/k) yields a 1-way ISR protocol with 0(k 2 ) bits of communication 
from Alice to Bob. □ 
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We are now ready to prove of Lemma 4.7. 

Proof of Lemma 4.7. Assume without loss of generality that k < n/ 20, since otherwise, Alice 
can simply send her entire input (n bits) to Bob, requiring only 0(k) communication. Run the 
protocols from Lemmas 4.9 and 4.10 in sequence 1 ' and declare that A(x,y) < k if and only if 
both protocols say so; otherwise, declare that A(x,y) > k + 1. This gives a 1-way ISR protocol 
with 0(k 2 ) bits of communication from Alice to Bob. 

The lower bound R(HD£) > fi(fc) follows from Lemma 3.3 as any protocol the computes HD£ 
can be used to compute GHD^ k k 1 as well. □ 

4.1.3 Strongly Permutation-Invariant functions 

In this section, we show that the ISR-CC is polynomially related to the PSR-CC - without any 
additive dependence on n - for a natural subclass of permutation-invariant functions that we 
call “strongly permutation-invariant functions”. We point out that this section is not needed for 
proving Theorem 1.3, but we include it because it highlights some of the proof ideas that we 
eventually use. We start by defining strongly permutation-invariant functions. 

Definition 4.11 ((Total) Strongly Permutation-Invariant functions). A (total) function f : {0, l} n x 
{ 0 , 1 }" —» { 0 , 1 } is strongly permutation-invariant if there exists a symmetric function a : { 0 , l} n —» 

{ 0 , 1 } and a function h : { 0 , l } 2 —» { 0 , 1 } such that for every x,y e { 0 , l} n , 

f (x,y) = cr{h(x 1 ,y 1 ),h(x 2 ,y 2 ), - ■ ■ ,h(x n ,y n )) 

Note that strongly permutation-invariant functions include as subclasses, (AND)-symmetric func¬ 
tions (studied, e.g., by [BdWOl, Raz03, Shell]) and XOR-symmetric functions (studied, e.g., by 
[ZS09]). 

The following theorem shows that ISR-CC of any strongly permutation-invariant function is poly¬ 
nomially related to its PSR-CC, with no dependence on n. 

Theorem4.12. For any total strongly permutation-invariant function f : {0, l} n x {0, l }' 1 —»{0,1}, 
if R(/) = k then, 

ISR (/) < 0(fc 2 ) 

ISRi^Cf) < 0 (fc 3 ) 

Proof. Depending on h, any such function depends only on the sum of some subset of the quan¬ 
tities {|x Ay|, |x A — >y|, [-ix A y|, [ —>x A —iy|}. There are three main cases to consider (the remaining 
cases being similar to these three): 

(i) / depends only on |x A y| + |x A —>y|: In this case, / depends only on |x[, and hence R(/), 
ISR(/) and ISR 1_way (/) are all 1. 

(ii) / depends only on |x A y| + |-ix A —>y|: In this case, / depends only on |x © y|. Let R(/) = k, 
and suppose i is such that /(x,y) = 0 for |x © y| = i — 1, and /(x,y) = 1 for |x© y| = i + 1. 

If i < n/2 then any protocol to compute / can be used to compute GHD ( iiV Applying 
Lemma 3.3 we get that, k = R(/) > R(GHD" | ., a ) > ICtGHD^ ; j) > i/C. If i > n/2, then 
any protocol to compute / can be used to compute GHd" ( i r Applying Lemma 3.3 again, 

we get that, k = R(/) > R(GHD|' n _ i;: ) > IC(GHd| 1 (1 _ ; ; a ) > Fl(n — i ). Thus, we get that for 
any such i, it must be the case that either i < Ck or i > n — Ck. 

10 More precisely, we first repeat each of the two protocols a constant number of times and take a majority-vote of the 
outcomes. This allows us to reduce the error probability to a small enough constant and thereby apply a union bound. 
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Alice and Bob now use the 1-way ISR protocol given by Lemma 4.7 to solve HD" for every i 
such that i < Ck or i > n — Ck, and for each such problem, they repeat the protocol O(logfc) 
times to make the error probability down to 0(l/k). This yields a 1-way ISR protocol with 
0(k 3 ) bits of communication from Alice to Bob. This protocol can be modified into a 2-way 
ISR protocol with only 0(k 2 ) bits of communication by letting Alice and Bob binary-search 
over the O(k) Hamming distance problems that they need to solve, instead of solving all of 
them in parallel. 

(iii) / depends only on [xAy|: Suppose j is such that /(x,y) = 0 when |xAy| = j and /(x,y) = 1 
when |x A y| = j + 1. Then, if we restrict to only x and y such that |x| = |y| = (n + 2j)/3, 
then any protocol to compute / can be used to compute GHD a acl , where a = (n + 2j)/3 
and c = (2n — 2;)/3. Applying Lemma 3.3 we get that, k = R(/) > R(GHD” acl ) > 

IC(GHD^ a c 1 ) > 2 (n — j)/3C. This implies that j > n — 2Ck. In particular, we deduce 
that there are at most O(k) such values of j. On input pair (x,y), Alice checks whether 
[x| > n — 2Ck and Bob checks whether |y| > n — 2Ck. If one of these inequalities fail, then 
it is the case that |x A y| < n — 2Ck and the function value can be deduced. Suppose that 
|x| > n — 100 k and |y| > n — 100 k. In this case, |x| takes one of O(k) possibilities and 
hence Alice can send |x| to Bob using O(logk) bits. At this point, Bob knows both |x| and 
|y|. Thus, using the identity A(x,y) = |x| + |y| — 2|xAy|, the problem gets reduced to a 
collection 0(k)-Hamming Distance problems as in case (ii) above. The same protocols as in 
case (ii) imply a 1-way ISR protocol with 0(k 3 ) bits of communication from Alice to Bob, 
and a 2-way ISR protocol with 0(k 2 ) bits of communication. □ 

4.2 Overview of Proofs 

The proofs of the 1-way and 2-way parts of Theorem 1.3 follow the same general framework, 
which we describe next. Let (x,y) be the input pair, a := |x| and b := |y|. We partition the 
(a, b)-plane into a constant number of regions such that: 

(i) Using a small amount of communication, Alice and Bob can distinguish in which region 
their combined input lies. 

(ii) For each of these regions, there is an ISR protocol with small communication that computes 
the function value on any input in this region. 

Some of the region-specific protocols (in (ii) above) will be based on low-communication ISR 
protocols for two “atomic” problems: Small-Set-Intersection and Small-Hamming-Distance de¬ 
scribed in Section 4.1. 

We point out again that both of our protocols for Small-Set-Intersection and Small-Hamming- 
Distance have ISR-CC that is polynomial in the underlying PSR-CC, which is crucial for our pur¬ 
poses. In particular, one cannot instead use the generic exponential simulation of [CGMS14]. 

The additive O(loglogn) factor in the ISR-CC upper bound of Theorem 1.3 is due to the fact 
that for one region (other than the ones mentioned above), we show that in order for Alice 
and Bob to compute the function value / (x, y), it is enough that they compute some other low 
PSR-CC function / / (|x|, |y|) of the Hamming weights of (x,y). Since the Hamming weight of an 
n-bit string can be expressed using O(logn) bits, we have effectively reduced the “dimension” of 
the problem from n to O(logn). At this point, we can apply Newman’s theorem (Theorem 2.6) 
to obtain a private-coin protocol computing /'Qxl, |y|) (and hence /(x,y)) while increasing the 
communication cost by at most an additive O(loglogn). 
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4.3 2-way ISR Protocol for Permutation-Invariant Functions 

In this section, we prove the 2-way part of Theorem 1.3. We again use the measure m(/) (in¬ 
troduced in Definition 3.2) when restricted to total functions. For the sake of clarity, we describe 
the resulting specialized expression of m(/) again in the following proposition. 

Proposition 4.13 (Measure m(/) for total functions). Given a total permutation-invariant func¬ 
tion f : { 0 , l} n x { 0 , l} n —» { 0 , 1 }, and integers a, b, s.t. 0 < a, b < n, let h ab : { 0 , 1 , ■ • • , n} —> 
{0, 1, ?} be the function given by h a j,(d) = f(x, y) if there exist x, y with |x| = a, |y| = b, A(x,y) = d 
and ? otherwise. (Note, by permutation invariance of f, h ab is well-defined.) Let J?(h ab ) be the set 
of jumps in h ab , defined as follows, 

0(h 1 = f ir- Vb( C _ ^ h a ,b(c + 1) \ 

7 1 a ’ b) \ ‘ Kyic - g), h ab (c + g) € {0, 1} f 

Then, we define m(f ) as follows. 

m(f) = f —• max max{min {a, b, c, n — a, n — b, n — c}, log(min {c, n — c})} 

C a,be[n ] 

where C is a suitably large constant (which does not depend on n). 

We will now prove the following theorem, which immediately implies the 2-way part of Theo¬ 
rem 1.3. 

Theorem 4.14. Let f : {0,1 } n x {0,1 } n —» {0,1} be any (total) permutation-invariant function. 
Then, 

m(f) < R(/) < ISR(Z) < 0(m(/) 3 ) + R(/) + O(loglogn) 

Proof. Any protocol to compute / also computes GHO" , for any a, b and any jump c e 
Jf ih a , b ')- Consider the choice of a, b and a jump c e J(h a b ) such that the lower bound obtained 
on IC(GHD^ b 2 ) through Lemma 3.3 is maximized, which by definition is equal to m(f ) (after 
choosing an appropriate C in the definition of m(/)). Thus, we have m(/) < IC(GHD n , ) < 

CL,U,C 

IC(/)<R(/). 

Let k = f m(f). The main part of the proof is to show that ISR(/) < 0(k 3 )+R(/)+0(loglogn). 
We first divide the input space into a constant number of regions, such that Alice can send 0(log k) 
number of bits to Bob with which he can decide in which of the regions does their combined input 
lie (with high probability). Thus, once we break down the input space into these regions, it will 
suffice to give 2-way protocols with small ISR-CC for computing the function over each of these 
regions; as Alice and Bob can first determine in which region their combined input lies, and then 
run the corresponding protocol to compute the function value. 

Let a = |x| and b = |y|. We divide the (a, b)-plane into 4 regions, (I), (II), (III) and (IV), based 

on the values of a and b as follows. First let A = min {a, n — a} and B = min {b, n — b }. Then the 

regions are given by, 

(I) (A < Ck and B < 2 ck ) or (A < 2 ck and B < Ck ) 

(II) (A < Ck and B > 2 ck ) or (A > 2 ck and B < Ck ) 

(III) A > Ck and B > Ck and \A — B\ < Ck 

(IV) A > Ck and B > Ck and |A — B\ > Ck 
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Region I 

Region II 

Region III 

Region IV 


Figure 1: Division of the (a, b) plane into four regions in the 2-way setting 


where C comes from Definition 3.2. Note that regions (I), (II), (III) and (IV) form a partition of 
the (a, b)-plane. This division is shown pictorially in Figure 1. 

First, note that if |x| > n/ 2, then Alice can flip all her input bits and convey that she did so to 
Bob using one bit of communication. Similarly, if |y| > n/2, then Bob can flip all his input bits and 
convey that he did so to Alice using one bit of communication. After these flipping operations, 
Alice and Bob will look at the appropriately modified version of / based on who all flipped their 
input. Note that flipping all the bits of Alice and/or Bob preserves the permutation-invariance 
of the function. We will henceforth assume w.l.o.g. that a = |x| < n/2 and b = [y| < n/2. 
[This is also the reason that the regions described above are in terms of A = min {a, n — a} and 
B = min {b, n — b} and A,B < n/2.] 

Next, we show that determining the region to which the input pair (x,y) belongs can be done 
using O(logk) bits of (ISR) communication from Alice to Bob. First, Alice will send Bob two 
bits indicating whether a < Ck and whether a < 2 ck respectively. With this information Bob 
can determine in which of regions {(I), (II), (III) U (IV)} the combined input lies. To distinguish 
between regions (III) and (IV), Alice and Bob can first check whether \a — b\ < 100 k by setting 
up an instance of Sparse-Indexing. Namely, Alice will translate the value a into a string s a 
where s a (i) = 1 iff i = a. And Bob will translate b into a string s b such that s h (i) = 1 iff 
b — Ck < i < b + Ck. This is an instance of Sparse-Indexing which can be solved with O(logk) 
bits of ISR-CC, by Corollary 4.5. 

We now show how to compute the value of the function / in each of the 4 individual regions 
(I), (II), (III) and (IV) using ISR-CC of at most 0(k 3 ) + R(/) + O(loglogn) bits. Since / is 
a permutation-invariant function, we use the following 2 interchangeable representations of / 
using Observation 2.2, 


/(x,y) = g(|x|, |y|, |x Ay|) = h(|x|, |y|, A(x,y)) 


(I) (Main idea: Small-Set-Intersection) We have that either (a < Ck and b < 2 ck ) or (a < 2 ck 
and b < Ck). Since we can interchange the roles of Alice and Bob if required, we can 
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assume w.l.o.g. that a < Ck and b < 2 ck . In this case, Alice first sends the value of a = |x| 
to Bob. They can then apply the protocol from Lemma 4.2 in order to compute |x Ay| using 
O(alog(ab)) = 0(k 2 ) bits of 1-way ISR communication from Alice to Bob. Hence, Bob 
can determine |x A y| correctly with high probability, and hence deduce g(|x|, |y|, |x A y|) = 

/(x,y). 

(II) (Main idea: No dependence on A(x,y)) We have that either (a < Ck and b > 2 ck ) or 

(a > 2 ck and b < Ck). Since we can interchange the roles of Alice and Bob if required, we 
can assume w.l.o.g. that a < Ck and b > 2 ck . Then, the definition of the measure m(/) 
implies that for this range of values of a and b, the function h cannot depend on A(x,y) 
(because in this case A(x,y) > (b — a)). Since h depends only on |x| and |y|, Alice can 
simply send the value of a (which takes only O(log/c) bits), with which Bob can compute 
h{a, b,c ) for any valid c, that is, h a },(c) ?. 

(III) (Main idea: Small-Hamming-Distance) We have that \a - b\ < 100 k. In this case, Alice 
sends the value of a (mod 2 Ck) to Bob, requiring O(k) 1-way communication. Since Bob 
knows b, and so he can figure out the exact value of a. Next, they need to determine the 
Hamming distance A(x,y). The definition of our measure m(f) (along with the fact that 
k := m(/)) implies that if h(a, b,c — 1) ^ h(a, b,c + 1), then c must be either < Ck or 
> n — Ck. That is, given a and b, h{a, b,c ) must equal a constant for all valid c such that 
Ck < c < n — Ck. 

Since Bob knows both a and b exactly, Alice and Bob can run the 1-way ISR-protocol for 
HD” (from Lemma 4.7) for every i < Ck and i > n — Ck. This requires 0(k 3 ) communi¬ 
cation. 11 If the Hamming distance is either < Ck or > n — Ck, then Bob can determine it 
exactly and output h{a, b,c ). If the Hamming distance is not in this range, then the Ham¬ 
ming distance does not influence the value of the function, in which case Bob can output 
h{a, b,c ) for any valid c such that Ck < c < n — Ck. 

(IV) (Main idea: Newman’s theorem on (a, b)) We claim that for this range of values of a and b, 
the function h{a, b,c ) cannot depend on c. Suppose for the sake of contradiction that the 
function depends on c. Hence, there exists a value of c for which h{a, b,c— 1) ^ h(a, b, c+1). 

Since |a — b\ > Ck, we have that c > Ck. And since \n — a — b\ > Ck, we also have that 
c < min{a + b,2n — a — b} < n — Ck. Thus, we get that Ck< c < n — Ck, which contradicts 
that m(/) = k (see Proposition 4.13). 

Since /(x, y) only depends on |x| and |y| in this region, we have converted the problem 
which depended on inputs of size n, into a problem which depends on inputs of size O(logn) 
only. Since the original problem had a PSR-protocol with R(/) bits of communication, 
applying Newman’s theorem (Theorem 2.6), we conclude that, with private randomness 
itself, the problem can be solved using R(/)+0(loglogn) bits of 2-way communication. □ 

Note 4.15. We point that the proof of Theorem 4.14 shows, more strongly, that for any function 
G(.), the following two statements are equivalent: 

(1) Any total function f with R(/) = k has ISR(/) < poly(k) + G(n) 

(2) Any total permutation-invariant function f with R(/) = k has ISR (/) < poly(k)+G(0(logn)). 

Newman’s theorem (Theorem 2.6) implies that (1) holds for G(n) = O(logn), and hence we have 
that (2) also holds for G(n) = O(logn); thereby yielding the bound implied by Theorem 4.14. On the 
other hand, improving the O(loglogn) term in Theorem 4.14 to, say o(loglogn) will imply that for 
all total functions we can have G(n) in (1) to be o(logn). We note that currently such a statement 
is unknown. 

11 The ISR-protocol for HD" takes 0(i 2 ) < 0(k 2 ) communication each. But we can only afford to make error with 
probability 0(l/k), and thus for sake of amplification, the overall communication is 0(k 3 log k). 
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4.4 1-way ISR Protocol for Permutation-Invariant Functions 

In this section, we prove the 1-way part of Theorem 1.3. On a high level, the proof differs from 
that of the 2-way part in two aspects: 

1. The underlying measure that is being used. 

2. The partition of the (a, b)-plane into regions, which is no longer symmetric with respect to 
Alice and Bob as can be seen in Figure 2. 

We introduce a new measure m 1_way (/) as follows (this is the 1-way analog of Proposition 4.13). 

Definition 4.16 (Measure m 1 ' way (/) for total functions). Given a total permutation-invariant func¬ 
tion f : {0, l} n x {0, l} n —» {0,1}, and integers a, b, s.t. 0 < a, b < n, let h ab : {0,1, • • •, n} —» 
{0,1, ?} be the function given by h ab (d) = /(x, y) if there exist x, y with |x| = a, |y| = b, A(x,y) = d 
and ? otherwise. (Note, by permutation invariance of f, h a b is well-defined.) Let J?(h a b ) be the set 
of jumps in h ab , defined as follows, 

<z(h 1 = f In ■ Vb( c - ^ Vb( c + 1) \ 

^ 1 a ’ b) \ ' Kb(c - g), K !b (c + g) € {0,1}/ 

Then, we define m 1 ' way (f) as follows. 

def 1 

m 1 - way (f ) = —■ max max {min {a, c, n — a, n — c} , log (min {c, n — c})} 

C a,be[n ] 

(c.sle/CVfc) 

where C is a suitably large constant (which does not depend on n). 

We point out that the only difference between Definition 4.16 and Proposition 4.13 is that the 
term min {a, b,c,n — a,n — b,n — c} in Proposition 4.13 is replaced by min {a,c,n - a,n — c} in 
Definition 4.16. In particular. Definition 4.16 is not symmetric with respect to Alice and Bob. 

We will now prove the following theorem, which immediately implies the 1-way part of Theo¬ 
rem 1.3. 

Theorem 4.17. Let f : {0, l} n x {0, l} n —» {0,1} be any (total) permutation-invariant function. 
Then, 


m 1 - way (f) < R^(f) < ISR 1 way (/) < 0(m 1 ' way (f) 3 ) + R^(f) + O(loglogn) 


We will need the following lemma to show that the measure m 1 ' way (/) is a lower bound on 
Ri" way (/). This is analogous to Lemma 3.3, which was used to show that m(f) is a lower bound 
on R(/) and in particular, IC(/). 


Lemma 4.18. For all n, a, b, c, g for which GHD" . is a meaningful problem, the following lower 

Q.,U,C,g 

bounds hold, 


. - n 1 f min{a,n — a,c,n — c} 

Ri-^CGHD" r 


a,b,c,g J — q 


8 


, - n 1 f fmm{c,n — c} 

Ri- Way ( GHD a>fejCjg ) > - • [log [ - l -L - L 


We defer the proof of Lemma 4.18 to Section 4.5. For now, we will use this lemma to prove 
Theorem 4.17. 
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Proof. Any protocol to compute / also computes GHD” b 1 for any a, b and any jump c e 
^(ha b). Consider the choice of a, b and a jump c e c /(h a b ) such that the lower bound obtained 
on IC(GHD" bcl ) through Lemma 4.18 is maximized, which by definition is equal to m 1 ‘ way (/) 
(after choosing an appropriate C in the definition of m 1 ‘ way (/)). Thus, we have m l " way (/) < 
R l way (GHD" , ) < R^ff). 

U.,D,C,g 

Let k = f m lway (/). The main part of the proof is to show that ISR 1 ‘ way (/j < 0(/c 3 ) + R 1 ' way + 
O(loglogn). We first divide the input space into a constant number of regions, such that Alice 
can send O(logic) bits to Bob with which he can decide in which of the regions does their com¬ 
bined input lie (with high probability). Thus, once we break down the input space into these 
regions, it will suffice to give 1-way protocols with small ISR-CC for computing the function over 
each of these regions; as Alice can then send the 1-way messages corresponding to all of these 
regions, and Bob will first determine in which region their combined input lies and then use the 
corresponding messages of Alice to compute the function value. 

Suppose that we have a function / with m 1_way (/) = k. Let a = |x| and b = |y|. We 
partition the (a, b)-plane into 4 regions based on the values of a and b as follows. First let 
A = min {a,n — a} and B = min {b, n — b}. Then the regions are given by, 

(I) A< Ck and B < 2 ck 

(II) A< Ck and B > 2 ck 

(III) A> Ck and \A — B\ < Ck 

(IV) A>Ck and \A-B\> Ck 

where C comes from Definition 4.16. Note that the regions (I), (II), (III) and (IV) form a partition 
of the (a, 5) plane. This partition is shown pictorially in Figure 2. 

First, note that if |x| > n/2, then Alice can flip all her input bits and convey that she did so to 
Bob using one bit of communication. Similarly, if |y| > n/2, then Bob can flip all his input bits and 
convey that he did so to Alice using one bit of communication. After these flipping operations, 
Alice and Bob will look at the appropriately modified version of / based on who all flipped their 
input. Note that flipping all the bits of Alice and/or Bob preserves the permutation-invariance 
of the function. We will henceforth assume w.l.o.g. that a = |x[ < n/2 and b = |y| < n/2. 
[This is also the reason that the regions described above are in terms of A = min {a, n — a} and 
B = min {b,n — b} and A,B < n/2.] 

We now show that determining the region to which the input pair (x, y) belongs can be done 
using O(logk) bits of (ISR) communication from Alice to Bob. First, to distinguish between {(I), 
(II)} and {(III), (IV)}, Alice needs to send one bit to Bob indicating whether a < 100 k. Moreover, 
in the case of {(I), (II)}, Bob can easily differentiate between (I) and (II) because he knows the 
value of b. To distinguish between regions (III) and (IV), Alice and Bob can first check whether 
|a — b | < 100k by setting up an instance of Sparse-Indexing. Namely, Alice will translate the 
value a into a string s a where s a (i) = 1 iff i = a. And Bob will translate b into a string s b such 
that s b (i) = 1 iff b — Ck <i < b + Ck. This is an instance of Sparse-Indexing which can be solved 
with O(logk) bits of ISR-CC, by Corollary 4.5. 

We now show how to compute the value of the function / in each of the 4 individual regions 
(I), (II), (III) and (IV) using 1-way ISR-CC of at most 0(k 3 ) + R l way (/) + O(loglogn) bits. Since 
/ is a permutation-invariant function, we use the following 2 interchangeable representations of 
/ using Observation 2.2, 


/(x,y) = g(|x|, |y|, |x Ay|) = b(|x|, |y|, A(x,y)) 


(I) (Main idea: Small-Set-Intersection) We have that a < Ck and b < 2 ck . Alice can first 
send the value of a = |x| to Bob. They can then apply the protocol from Lemma 4.2 in order 
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Region I 

Region II 

Region III 

Region IV 


Figure 2: Division of the (a, b) plane into four regions in the 1-way setting 


to compute |xAy| using 0(alog(ab)) = 0(/c 2 ) bits of 1-way ISR communication from Alice 
to Bob. Hence, Bob can determine IxAyl correctly with high probability, and hence deduce 
g(|x|,|y|,|xAy|) = /(x,y). 

(II) (Main idea: No dependence on A(x,y)) We have that a < Ck and b > 2 ck . In this case, 
the definition of our measure m l way (/) implies that for this range of values of a and b, 
the function h cannot depend on A(x,y) (because in this case A(x,y) > (b — a)). Since h 
depends only on |x| and |y|, Alice can simply send the value of a (which takes only O(logk) 
bits), with which Bob can compute h(a, b, c) for any valid c, that is, b(a, b, c) ^ ?. 

(III) (Main idea: Small-Hamming-Distance) We have that that \a — b| < Ck. Then, Alice sends 
the value of a (mod 2 Ck) to Bob, requiring O(k ') 1-way communication. Since Bob knows b, 
he can figure out the exact value of a. Next, they need to determine the Hamming distance 
A(x,y). The definition of our measure m 1_way (/) (along with the fact that k := m 1 ‘ way (/)) 
implies that if h(a, b, c — 1) ^ b(a, b, c + 1), then c must be either < Ck or > n — Ck. That 
is, given a and b, b(a, b, c) must equal a constant for all valid c such that Ck < c < n — Ck. 

Since Bob knows both a and b exactly, Alice and Bob can run the 1-way ISR-protocol for 
HD" (from Lemma 4.7) for every i < Ck and i > n — Ck. This requires 0(k 3 ) communi¬ 
cation. 12 If the Hamming distance is either < Ck or > n — Ck, then Bob can determine it 
exactly and output h(a, b,c). If the Hamming distance is not in this range, then the Ham¬ 
ming distance does not influence the value of the function, in which case Bob can output 
h(a, b,c) for any valid c such that Ck < c < n — Ck. 

(IV) (Main idea: Newman’s theorem on (a, b)) We claim that for this range of values of a and b, 
the function h(a, b,c ) cannot depend on c. Suppose for the sake of contradiction that the 
function depends on c. Hence, there exists a value of c for which h(a, b,c— 1) ^ h(a, b, c+1). 

Since |a — b| > Ck, we have that c > Ck. And since \n — a — b\ > Ck, we also have that 

12 The ISR-protocol for HD" takes 0(i 2 ) < 0(k 2 ) communication each. But we can only afford to make error with 
probability 0(l/k), and thus for sake of amplification, the overall communication is 0(k 3 log k). 
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c < min{a + b,2n — a — b} < n — Ck. Thus, we get that Ck< c < n — Ck, which contradicts 
that m 1_way (/) = k (see Definition 4.16). 

Since /(x, y) only depends on |x| and |y| in this region, we have converted the problem 
which depended on inputs of size n, into a problem which depends on inputs of size 
O(logn) only. Since the original problem had a 1-way PSR-protocol with R 1 way (/) bits 
of communication, applying Newman’s theorem (Theorem 2.6), we conclude that, with 
private randomness itself, the problem can be solved using R l way (/) + O(loglogn) bits of 
1-way communication. □ 

Note 4.19. We point that the proof of Theorem 4.17 shows, more strongly, that for any function 
G(.), the following two statements are equivalent: 

(1) Any total function f with R 1_way (/) = k has ISR 1 ' way (/) < poly(fc) + G(n) 

(2) Any total permutation-invariant function f with R 1_lva ^(/) = k has ISR 1 ' way (/) < poly(fc) + 
G(0(logn)). 

Newman’s theorem (Theorem 2.6) implies that (1) holds for G(n) = O(logn), and hence we have 
that (2) also holds for G(n) = O(logn); thereby yielding the bound implied by Theorem 4.17. On the 
other hand, improving the O(loglogn) term in Theorem 4.17 to, say o(loglogn) will imply that for 
all total functions we can have G(n) in (1) to be o(logn). We note that currently such a statement 
is unknown. 

4.5 1-way CC lower bounds on Gap Hamming Distance 

In this section, we prove lower bounds on 1-way randomized communication complexity of Gap- 
Hamming-Distance (Lemma 4.18). 

We will prove Lemma 4.18 by getting certain reductions from Sparse-Indexing (namely Proposi¬ 
tion 2.13). Similar to our approach in Section 3.3, we first prove lower bounds on 1-way PSR-CC 
of Set-Inclusion (Definition 3.8). We do this by obtaining a reduction from Sparse-Indexing. 

Proposition 4.20 (Set-Inclusion 1-way PSR-CC lower bound). For all t, w e N, 

R 1 -^(SetInc[+^_ 1 ) > 0(min(t,w)) 

Proof. We know that R 1_way (SPARSElND^ +w ) > f2(min(t, w)) from Proposition 2.14. Note that 
SparseInd[ +w is same as the problem of GHD^ r If we instead think of Bob’s input as comple- 

-t+W 

mented, we get that solving GHD t t+w _ 1 w 1 is equivalent to solving GHD t 111 , which is same as 
SetInc^^ w _ 1 . Thus, we conclude that R 1 ' way (SETlNC^^ w _ 1 ) > n(min(t,w)). □ 

We now state and prove a technical lemma that will help us prove Lemma 4.18. Note that this is 
a 1-way analogue of Lemma 3.11. 

Lemma 4.21. Let a, b < n/2. Then, the following lower bounds hold, 

(i) R^CGHD^J > n (m in { 

(ii) Ri-- y (Giro" Acg )>D(min{^,^}) 

(Hi) R^CGHD^^) > O (min {log ( j) .log ) }) 
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Proof. We first note that part (iii) follows trivially from Lemma 3.11 because IC(/) < R(/) < 
R'- wa -v(/). But we require slightly different proofs for parts (i) and (ii). The main technical dif¬ 
ference in these proofs and the corresponding ones in Lemma 3.11 is that now we cannot assume 
that a < b < n/2. We note that this assumption of a < b was extremely crucial for Lemma 3.11, 
without which parts (i) and (ii) would not have been possible (even though they look exactly 
same in form!). The reason why we are able to succeed here without making the assumption be¬ 
cause we derive reductions from a much “simpler” problem, but which is still hard in the 1-way 
model, namely the Sparse-Indexing problem. 


Proof of (i). We obtain a reduction from SparseInd^ for which we know from Proposition 2.14 
that R 1 ‘ way (SPARSElND^ t ) = O(t). Recall that SparseInd^ is same as GHD f 11 r We repeat the 

instance g times to get an instance of GHD“ t . Now, we need to append (a - gt ) l’s to x 
and (b — g) l’s to y. This will increase the Hamming distance by a fixed amount which is at least 
|a — b — gt + g | and at most b + a — gt — g. Also, the number of inputs we need to add is at least 
((a — gt) + (6 — g) + (c — gt))/ 2. Thus, we can get a reduction to GHD . if and only if, 

\a—b — gt + g\<c — gt<b + a — gt — g 
^ 0 (a-gt) + (b-g) + (c-gt) 

The left condition on c gives us that 2 gt<c + a — b + g and b — a + g < c (which is always 
true). The right condition on c gives c < b + a — g (which is always true). The condition on n 
gives that gt<n — (a + b + c — g)/2, which is equivalent to 


t < 


n — a — b 
2g 


n-c-g 
2 g 


Thus, we can have the reduction work by choosing t to be 


t = mm 


c-b + a + g n-c-g 


2 g 


2g 


Pi min 


c — b + a n — c 


g 


g 


(since n> a + b) and thus we obtain 


R lway (GHD" bcg )> R 1 - way (SPARSElNof ) > fl ( min 


c — b + a n — c 1 A 

g g iJ 


Proof of (ii). We obtain a reduction from SetInc ^ (:+w _ 1 (where m = t + w) for which we know 
from Proposition 4.20 that R 1 ' way (SPARSElNDj n t+w _ 1 ) = Q(min {t,w}). Recall that SETlNcj 71 t+w _ 1 
is same as GHD| n t+w _ 1 w 1 . Given an instance of GHD™ t+vv _ 1 w 1 , we first repeat the instance g 
times to get an instance of GHD^ t g ^ t+w _ 1 ^ gw . Now, we need to append (a — gt) l’s to x and 
(b — gt — gw + g) l’s to y. This will increase the Hamming distance by a fixed amount which is 
at least | b — a — gw + g| and at most (b — gt — gw + g) + (a — gt). Also, the number of inputs we 
need to add is at least ((a — gt) + (b — gt — gw + g) + (c — gw))/2. Thus, we can get a reduction 
to GHD” b c if and only if, 


|b-a-gw + g|<c-gw<b + a- 2 gt - gw + g 

(b-gt-gw + g) + (a-gt) + (c-gw) 
n> gt + gw -1--- 


The left constraint on c requires c > max {b — a + g,a + 2gw — b — g}. We know that c > b — 
a + g, so the only real constraint is c > 2gw — (b — a) + g, which gives us that. 


c+b-a-g 
w < - 

2g 
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The right constraint on c requires c < b + a — 2gt + g, which gives us that, 


t < 


a + b-c + g 


Suppose we choose t to be a+b c+g . Then the constraint on n is giving us that, 

a+b+c-g a+b-c+g a+b+c-g 
n > gt H---=---1--- = a + b 


We already assumed that a, b < n/2, and hence this is always true. 

Thus, we can choose t = a+b c+s and w = c+b a g , and from Proposition 4.20, we get, 


R i-way( GHE £ b c J > Ri-w^y(SETlNcf+ w ) > min({t,w}) > fl ( min 


a + b — c c + b — a 


□ 


We are now finally able to prove Lemma 4.18. 

Proof of Lemma 4.18. Assume for now that a, b < n/2. From parts (i) and (ii) of Lemma 4.21, 
we know the following. 


Rt-way(GHD^ hcg ) > Q^min 
R^CGHD" fa c g ) > min 


c — b + a n — c 


g g 
a + b — c c + b — a 


g 


g 


Adding these up, we get that, 


R i - wa y(GHD a Acg ) >n[mm 


b + a n — c 


g 


g 


+ min 


a + b — c c + b — a 


g 


Since min {A,B} + min {C,D} = min {A + C,A + D,B + C,B + D}, we get that. 


. - n , ,2a 2c n + a + b — 2c n + b — a 

Ri-way (G HD a ) > n min —, -,- 

’” g 1 1 g g g g 


For the last two terms, note that, n + a + b — 2c>n — c (since a + b > c) and n + b — a>n — a> a 
(since n/2 > a). Thus, overall we get. 


R i-way( GHD 


a,b,c,g 


a c n — c 

) > FI min t —, —,- 

8 g g 


Note that this was assuming a,b < n/2. In general, we get. 


Rt-wayfGHD" b c g ) > O 


min {a, c,n — a,n — c} 
g 


[We get the (n — a) term because we might have flipped Alice’s input to make sure a < n/2 But 
unlike in the proof of Lemma 3.3, we don’t get b or (n — b ) in the above lower bound because 
while restricting to a, b < n/2, we never flipped the role of Alice and Bob.] 

The second part follows trivially from the corresponding part of Lemma 3.3, since IC(/) < R(/) < 
Ri-way(/) for any/. 

We choose C to be a large enough constant, so that the desired lower bounds hold. □ 
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5 Conclusion 


In this work, we initiated the study of the communication complexity of permutation-invariant 
functions. We gave a coarse characterization of their information complexity and communica¬ 
tion complexity (Theorem 1.1). We also showed for total permutation-invariant functions that 
the communication complexity with imperfectly shared randomness is not much larger than the 
communication complexity with perfectly shared randomness (Theorem 1.3). Our work points 
to several possible future directions. 

• [Generalized Permutation-Invariance] Is it possible to generalize our results for any 
larger class of functions? One candidate might be classes of functions that are invariant un¬ 
der natural subgroups of the permutation group S n , or more generally any group of actions 
on the input spaces of Alice and Bob. For example, once choice of a subgroup of permuta¬ 
tions is the set of all permutations on [n], that map [n/2] to [n/ 2] and map [n] \ [n/2] to 
[n] \ [n/2]. Or more generally, the subgroup of (-part permutation-symmetric functions, 
which consists of functions / for which there is a partition I = {l lt ... ,I t } oi [n] such that 
/ is invariant under any permutation n e S n where n{l{) = I t for every ie[l,...,(}. 

• [Permutation-Invariance over higher alphabets] Another interesting question is to gen¬ 
eralize our results to larger alphabets, i.e., to permutation-invariant functions of the form 
/ : 3C n x ty 11 —> R where 3C, W and R are not necessarily binary sets. 

• [Tight lower bounds for Gap-Hamming-Distance] What are the tightest lower bounds on 

GHD” b c for all choices of parameters, a, b, c, g ? Our lower bounds on GHD" b c are 
not tight for all choices of parameters a, b, c and g. For example, when a = b = c = n/2 
and g = y/n, our lower bound in Lemma 3.3 only implies IC(GHD' 1 . ) > Ofyfn). By 

slightly modifying the proof techniques in [CR12, Vidll, Shel2], one can obtain that 
R(GHD" b c ) = Q(n). Sherstov’s proof [Shel2] is based on the corruption bound. Combin¬ 
ing this with the recent result due to [KLL + 12] which showed (by studying a relaxed version 
of the partition bound of [JK10]) that many known lower bound methods for randomized 
communication complexity - including the corruption bound - are also lower bounds for 
the information complexity. This implies that IC(GHD] h .. ) = til(n) for a = b = c = n/2 
and g = y/n. 

• [Hierarchy within ISR] The work of [CGMS14] shows an exponential separation between 
ISR p (/) and R(/). However, it is unclear if some strong separation could be shown between 
ISR p (/) and ISR p /(/) for some function / (where p < p' < 1). 

• [Limits of separation in [CGMS14]] Canonne et al showed that for some unspecified k, 
there is a partial permutation-invariant function with communication at most k under per¬ 
fect sharing of randomness, but with communication at least fl(2 fc ) under imperfect sharing 
of randomness. Can their separation be made to hold for k = 0(loglogn)? Answering this 
question would shed some light on the possibility of proving an analogue of Theorem 1.3 
for partial permutation-invariant functions. 
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